Protein disulfide isomerase and ABC transporter homologous proteins involved in the regulation of energy homeostasis

ABSTRACT

The present invention discloses three novel proteins regulating the energy homeostasis and the metabolism of triglycerides, and polynucleotides, which identify and encode the proteins disclosed in this invention. The invention also relates to vectors, host cells, antibodies, and recombinant methods for producing the polypeptides and polynucleotides of the invention. The invention also relates to the use of these sequences in the diagnosis, study, prevention, and treatment of diseases and disorders related to body-weight regulation, for example, but not limited to, metabolic diseases such as obesity, as well as related disorders such as adipositas, eating disorders, wasting syndromes (cachexia), pancreatic dysfunctions (such as diabetes mellitus), hypertension, pancreatic dysfunctions, arteriosclerosis, coronary artery disease (CAD), coronary heart disease, hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the reproductive organs, sleep apnea, and others.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a divisional of U.S. application Ser. No. 10/473,696 filed on Sep. 29, 2003, now U.S. Pat. No. 7,211,563, which claims priority to PCT Patent Application No. PCT/EP02/03540 filed Mar. 28, 2002 and European Patent Application Nos. EP01 108 315.1 filed Apr. 2, 2001 and EP01 113 419.4 filed Jun. 1, 2001.

DESCRIPTION

This invention relates to the use of nucleic acid and amino acid sequences of protein disulfide isomerase and ABC transporter homologous proteins, and to the use of these sequences in the diagnosis, study, prevention, and treatment of diseases and disorders related to body-weight regulation, for example, but not limited to, metabolic diseases such as obesity, as well as related disorders such as adipositas, eating disorders, wasting syndromes (cachexia), pancreatic dysfunctions (such as diabetes mellitus), hypertension, arteriosclerosis, coronary heart disease, hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the reproductive organs, sleep apnea, and others.

Obesity is one of the most prevalent metabolic disorders in the world. It is still poorly understood human disease that becomes more and more relevant for western society. Obesity is defined as an excess of body fat, frequently resulting in a significant impairment of health. Besides severe risks of illness such as diabetes, hypertension and heart disease, individuals suffering from obesity are often isolated socially. Human obesity is strongly influenced by environmental and genetic factors, whereby the environmental influence is often a hurdle for the identification of (human) obesity genes. Obesity is influenced by genetic, metabolic, biochemical, psychological, and behavioral factors. As such, it is a complex disorder that must be addressed on several fronts to achieve lasting positive clinical outcome. Obese individuals are prone to ailments including: diabetes mellitus, hypertension, coronary heart disease, hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the reproductive organs, and sleep apnea.

Obesity is not to be considered as a single disorder but a heterogeneous group of conditions with (potential) multiple causes. Obesity is also characterized by elevated fasting plasma insulin and an exaggerated insulin response to oral glucose intake (Koltermann, J. Clin. Invest 65, 1980, 1272-1284) and a clear involvement of obesity in type 2 diabetes mellitus can be confirmed (Kopelman, Nature 404, 2000, 635-643).

Even if several candidate genes have been described which are supposed to influence the homeostatic system(s) that regulate body mass/weight, like leptin, VCPI, VCPL, or the peroxisome proliferator-activated receptor-gamma co-activator, the distinct molecular mechanisms and/or molecules influencing obesity or body weight/body mass regulations are not known.

Therefore, the technical problem underlying the present invention was to provide for means and methods for modulating (pathological) metabolic conditions influencing body-weight regulation and/or energy homeostatic circuits. The solution to said technical problem is achieved by providing the embodiments characterized in the claims.

Accordingly, the present invention relates to genes with novel functions in body-weight regulation, energy homeostasis, metabolism, and obesity. The present invention discloses specific genes involved in the regulation of body-weight, energy homeostasis, metabolism, and obesity, and thus in disorders related thereto such as eating disorder, cachexia, diabetes mellitus, hypertension, coronary heart disease, hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the reproductive organs, and sleep apnea. The present invention describes human protein disulfide isomerase, MRP4, and ABC8 (white) genes as being involved in those conditions mentioned above.

Protein disulfide isomerase (PDI) is a protein-thiol oxidoreductase that catalyzes the folding of protein disulfides. PDI has been demonstrated to participate in the regulation of reception function, cell-cell interaction, gene expression, and actin filament polymerization. PDI has acts as a chaperone and subunit of prolyl 4-hydroxylase and microsomal triglyceride transfer protein (MTP). MTP is accelerating the transport of triglyceride, cholesteryl ester, and phospholipid between membranes (Berriot-Varoqueaux et al., 2000, Annu Rev Nutr. 20, 663-697). Mutation in MTP are the cause for abetalipoproteinemia, a hereditary disease with limited production of chlomicrons and very low-density lipoproteins (VLDL) in the intestine and liver (Rehberg et al., 1996, J Biol Chem. 271(47), 29945-52). Intracellular VLDL is associated with chaperones including PDI and glucose regulated protein 94 (GRP94, endoplasmin) and assembles with apolipoprotein B (Berriot-Varoqueaux et al., SUPRA). These chaperones are endogenous substrates of sphingosine-dependent kinases (SDKs) and regulated by signal transduction pathways (see, for example, Megidish et al., 1999, Biochemistry. 38(11), 3369-78).

The chaperones are found in the endoplasmic reticulum where the lipidation of lipoproteins like apolipoprotein B might take place (Hussain et al., 1997, Biochemistry. 36(42), 13060-7.; Wu et al., 1996, J Biol Chem. 271(17), 10227-81). In addition, the secretion of apolipoprotein B is dependent on PDI (Wang et al., 1997, J. Biol. Chem. 272(44), 27644-51).

The formation or assembly of additional proteins strongly depends on the activity of specific groups of chaperones. PDI is regulating the formation of native insulin from its precursors and the insulin degradation (Tang et al., 1988, Biochem J. 255(2), 451-5; Duckworth et al., 1998, Endocr Rev. 19(5), 608-24). Insulin signaling is crucial for the proper regulation of blood glucose levels and lipid metabolism.

Dietary energy tissue-specifically regulates endoplasmic reticulum chaperone gene expression in the liver of mice, especially glucose regulated proteins (Dhahbi et al., 1997, J Nutr. 127(9), 1758-64). Additionally PDI mRNA is strongly expressed in adipose tissue (Klaus et al., 1990, Mol Cell Endocrinol. 73(2-3), 105-10), emphasizing their important roles in metabolic pathways.

Chaperones are also essential for the cellular protection against stress in its different forms like oxidative, heat shock or hypoglycemic stress (Barnes & Smoak, 2000, Anat Embryol. 202(1), 67-74, Lee, 1992. Curr Opin Bilo. 4(2), 267-73) preventing cells to undergo apoptosis.

Furthermore, chaperones are involved in different diseases like Alzheimer's Disease (Yoo et al., 2001, Biochem Biophys Res Commun. 280(1), 249-58), Parkinson (Duan & Mattson, 1999, J Neurosci Res. 59(13), 195-206), Rheumatoid Arthritis (Corrigall et al., 2001, J Immunol. 166(3), 1492-98) or neuropsychological disease leading to suicide of patients (Bown et al., 2000, Neuropsychopharmacology. 22(3), 327-32).

ATP-binding cassette (ABC) genes encode a family of transport proteins that are known to be involved in a number of human genetic diseases. For example, polymorphisms of the human homologue of the Drosophila white gene are associated with mood and panic disorders (Nakamura et al. 1999 Mol Psychiatry. 4(2):155-62). Mutations in the canilicular multispecific organic anion transporter (cMOAT) gene could be the reason for the Dubin-Johnson syndrome leading to hyperbilirubinemia II (Wada et al. 1998, Hum Mol Genet. 7(2):203-7). The rod photoreceptor-specific ABC transporter (ABCR) is responsible for the Stargardt disease (Allikmets et al. 1997, Nat Genet. 15(3):236-46). The gene encoding ATP-binding cassette transporter 1 (ABC1) is mutated in Tangier disease leading to the absence of plasma high-density lipoprotein (HDL) and deposition of cholesteryl esters in the reticulo-endothelial system with splenomegaly and enlargement of tonsils and lymph nodes (Brooks-Wilson et al. 1999 Nat Genet. 22(4):336-45, Bodzioch et al. 1999 Nat Genet. 22(4)-347-51, Rust et al. 1999, Nat Genet. 22(4):352-5). Furthermore a subgroup of ABC transporters are involved in cellular detoxification causing multidrug resistance that counteracts e.g. anti-cancer or HIV treatment (Schuetz et al. 1999 Nat Med. 1999 (9):1048-51).

ABC transporters transport several classes of molecules. ABC1 (ABCA1), the gene mutated in Tangier disease, mediates apoAl associated export of cholesterol and phospholipids from the cell and is regulated by cholesterol efflux ((Brooks-Wilson et al. 1999 Nat Genet. 22(4):336-45, Bodzioch et al. 1999 Nat Genet. 22(4):347-51, Rust et al. 1999, Nat Genet. 22(4):352-5 Brooks-Wilson et al. 1999, Bodzioch et al. 1999, Rust et al. 1999). ABC1 is expressed on the plasma membrane and Golgi complex and the lipid export process needs vesicular budding between Golgi and plasma membrane that is disturbed in Tangier disease. LDL and HDL₃ regulate the expression of ABC1 in macrophages (Orsó et al. 2000 Nat Genet. 24:192-6).

The expression of the human homologue of the Drosophila white gene (ABC8 or ABCG1) is induced in monocyte-derived macrophages during cholesterol influx mediated by LDL and is downregulated through lipid efflux mediated by cholesterol acceptor HDL₃. ABC8 is expressed on the cell surface and intracellular compartments of cholesterol-loaded macrophages and its expression is also regulated by oxysterols that act through nuclear oxysterol receptors, liver X receptors (LXRs) and by retinoid X receptor ligand. Therefore, ABC8 activity in macrophages might be crucial for cholesterol metabolism and the development of arteriosclerosis similar to ABC1. LXR expression is regulated through PPARg gamma signalling that is essential for adipogenesis and therefore PPARg gamma signalling might regulate the expression of ABC1 and ABC8 transporters at least in macrophages (Klucken et al. 2000 Proc Natl Acad Sci USA. 97(2):817-22., Venkateswaran et al. 2000 J Biol Chem. 275(19):14700-7).

Another subgroup of ABC transporters mediates cellular detoxification and is therefore named Multidrug Resistance-associated Proteins (MRPs) or canilicular Multispecific Organic Anion Transporters (cMOATs). MRP1 has high activity towards compounds conjugated to glutathione (GSH), glucuronide or sulfate and transports sphingolipids, eicosanoids, phosphatidylcholine and phosphatidylethanolamine analogues but the main function is the cellular detoxification. MRP4 (MOAT-B) overexpression is associated with high-level resistance to the nucleoside analogues 9-(2-phosphonylmethoxyethyl)adenine and azidothymidine monophosphate, both of which are used as anti-human immunodeficiency virus (HIV) drugs (Schuetz et al. 1999 Nat Med. 5(9):1048-51). MRP4 function in lipid transport is unknown despite the fact that LDL and HDL₃ regulate its expression in macrophages.

So far, it has not been described that protein disulfide isomerase or ABC transporters and the homologous human proteins are involved in the regulation of energy homeostasis and body-weight regulation and related disorders, and thus, no functions in metabolic diseases and other diseases as listed above have been discussed. In this invention we demonstrate that the correct gene doses of protein disulphide isomerase and/or of two ABC transporter genes are essential for maintenance of energy homeostasis. A genetic screen was used to identify that the protein disulfide isomerase gene, the MRP4 gene, and/or the white (ABC8) gene cause obesity in Drosophila melanogaster, reflected by a significant increase of triglyceride content, the major energy storage substance.

Polynucleotides encoding proteins with homologies to protein disulfide isomerase or ABC transporters present the opportunity to investigate diseases and disorders, including metabolic diseases and disorders such as obesity, as well as related disorders such as described above. Molecules related to protein disulfide isomerase and ABC transporters satisfy a need in the art by providing new compositions useful in diagnosis, treatment, and prognosis of diseases and disorders, including metabolic diseases and disorders such as described above.

Before the present proteins, nucleotide sequences, and methods are described, it is understood that this invention is not limited to the particular methodology, protocols, cell lines, vectors, and reagents described as these may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention which will be limited only by the appended claims. Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods, devices, and materials are now described. All publications mentioned herein are incorporated herein by reference for the purpose of describing and disclosing the cell lines, vectors, and methodologies which are reported in the publications which might be used in connection with the invention. Nothing herein is to be construed as an admission that the invention is not entitled to antedate such disclosure.

The present invention discloses a novel protein disulfide isomerase homologous protein and two novel ABC transporter homologous proteins regulating the energy homeostasis and the fat metabolism, especially the metabolism and storage of triglycerides, and polynucleotides, of triglycerides, and polynucleotides, which identify and encode the proteins disclosed in this invention. The invention also relates to vectors, host cells, antibodies, and recombinant methods for producing the polypeptides and polynucleotides of the invention. The invention also relates to the use of these sequences in the diagnosis, study, prevention, and treatment of diseases and disorders related to body-weight regulation, for example, but not limited to, metabolic diseases such as obesity, as well as related disorders such as adipositas, eating disorders, wasting syndromes (cachexia), pancreatic dysfunctions (such as diabetes mellitus), hypertension, arteriosclerosis, coronary heart disease, hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the reproductive organs, sleep apnea, and others.

Protein disulfide isomerase and ABC transporter homologous proteins and nucleic acid molecules coding therefor are obtainable from insect or vertebrate species, e.g. mammals or birds. Particularly preferred are human protein disulfide isomerase and ABC transporter homologous nucleic acids, particularly nucleic acids encoding human protein disulfide isomerase homologous protein (MGC3178; Genbank Accession No. NM_(—)030810 (identical to Genbank Accession No. BC001199) for the mRNA; NP_(—)110437.1 (identical to Genbank Accession No. AAH01199) for the protein), human ATP-binding cassette, subfamily C (CFTR/MRP), member 4 (ABCC4; MRP4; Genbank Accession No. NM_(—)005845 for the mRNA; NP_(—)005836.1 for the protein), and human ATP-binding cassette, sub-family G (WHITE), member 1 (ABCG1; WHITE, Genbank Accession No. XM_(—)009777 for the mRNA; XP_(—)009777.3 for the protein). Also particularly preferred are Drosophila protein disulfide isomerase homologous and ABC transporter homologous nucleic acids and polypeptides encoded thereby. In a preferred embodiment the present invention also comprises so-called “ABC-tran” domains of the proteins and nucleic acid molecules coding therefor.

The invention particularly relates to a nucleic acid molecule encoding a polypeptide contributing to regulating the energy homeostasis and the metabolism of triglycerides, wherein said nucleic acid molecule comprises

-   (a) the nucleotide sequence shown in SEQ ID NO:1, SEQ ID NO:3, or     SEQ ID NO:5, -   (b) a nucleotide sequence which hybridizes at 66° C. in a solution     containing 0.2×SSC and 0.1% SDS to the complementary strand of a     nucleic acid molecule encoding the amino acid sequence of SEQ ID     NO:2, SEQ ID NO:4, or SEQ ID NO:6, -   (c) a sequence corresponding to the sequences of (a) or (b) within     the degeneration of the genetic code, -   (d) a sequence which encodes a polypeptide which is at least 85%,     preferably at least 90%, more preferably at least 95%, more     preferably at least 98% and up to 99.6% identical to SEQ ID NO:2,     SEQ ID NO:4, or SEQ ID NO:6, -   (e) a sequence which differs from the nucleic acid molecule of (a)     to (d) by mutation and wherein said mutation causes an alteration,     deletion, duplication or premature stop in the encoded polypeptide     or -   (f) a partial sequence of any of the nucleotide sequences of (a)     to (e) having a length of at least 15 bases, preferably at least 20     bases, more preferably at least 25 bases and most preferably at     least 50 bases.

The invention is based on the discovery that protein disulfide isomerase homologous proteins (particularly PDI-like; referred to as DevG20) and ABC transporter homologous proteins (particularly MRP4 and WHITE; herein referred to as DevG4 and DevG22, respectively) and the polynucleotides encoding these, are involved in the regulation of triglyceride storage and therefore energy homeostasis. The invention describes the use of these compositions for the diagnosis, study, prevention, or treatment of diseases and disorders related to such cells, including metabolic diseases such as obesity, as well as related disorders such as adipositas, eating disorders, wasting syndromes (cachexia), pancreatic dysfunctions (such as diabetes mellitus), hypertension, arteriosclerosis, coronary heart disease, hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the reproductive organs, sleep apnea, and others.

To find genes with novel functions in energy homeostasis, metabolism, and obesity, a functional genetic screen was performed with the model organism Drosophila melanogaster (Meigen). One resource for screening was a proprietary Drosophila melanogaster stock collection of EP-lines. The P-vector of this collection has Gal4-UAS-binding sites fused to a basal promoter that can transcribe adjacent genomic Drosophila sequences upon binding of Gal4 to UAS-sites. This enables the EP-line collection for overexpression of endogenous flanking gene sequences. In addition, without activation of the UAS-sites, integration of the EP-element into the gene is likely to cause a reduction of gene activity, and allows determining its function by evaluating the loss-of-function phenotype.

Triglycerides are the most efficient storage for energy in cells. In order to isolate genes with a function in energy homeostasis, several thousand proprietary EP-lines were tested for their triglyceride content after a prolonged feeding period (see Examples for more detail). Lines with significantly changed triglyceride content were selected as positive candidates for further analysis.

Obese people mainly show a significant increase in the content of triglycerides. In this invention, the content of triglycerides of a pool of flies with the same genotype after feeding for six days was analyzed using a triglyceride assay, as, for example, but not for limiting the scope of the invention, is described below in the EXAMPLES section.

The invention encompasses polynucleotides that encode DevG20, DevG4, DevG22, and homologous proteins. Accordingly, any nucleic acid sequence, which encodes the amino acid sequences of DevG20, DevG4, or DevG22, can be used to generate recombinant molecules that express DevG20, DevG4, or DevG22. In a particular embodiment, the invention encompasses the nucleic acid sequence of 1693 base pairs (PDI, referred to as DevG20 SEQ ID NO:1) as shown in FIG. 4A, the nucleic acid sequence of 4487 base pairs (MRP4, referred to as DevG4, SEQ ID NO:3) as shown in FIG. 9A, and the nucleic acid sequence of 2459 base pairs (ABC8, or white, referred to as DevG22 SEQ ID NO:5) as shown in FIG. 15A. It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic code, a multitude of nucleotide sequences encoding DevG20, DevG4, or DevG22, some bearing minimal homology to the nucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the invention contemplates each and every possible variation of nucleotide sequence that could be made by selecting combinations based on possible codon choices. These combinations are made in accordance with the standard triplet genetic code as applied to the nucleotide sequences of naturally occurring DevG20, DevG4, or DevG22, and all such variations are to be considered as being specifically disclosed. Although nucleotide sequences which encode DevG20, DevG4, or DevG22 and variants thereof are preferably capable of hybridising to the nucleotide sequences of the naturally occurring DevG20, DevG4, or DevG22 under appropriately selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding DevG20, DevG4, or DevG22 or derivatives thereof possessing a substantially different codon usage. Codons may be selected to increase the rate at which expression of the peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons are utilised by the host. Other reasons for substantially altering the nucleotide sequence encoding DevG20, DevG4, or DevG22 and derivatives thereof without altering the encoded amino acid sequences include the production of RNA transcripts having more desirable properties, such as a greater half-life, than transcripts produced from the naturally occurring sequences. The invention also encompasses production of DNA sequences, or portions thereof which encode DevG20, DevG4, or DevG22 and derivatives thereof, entirely by synthetic chemistry. After production, the synthetic sequence may be inserted into any of the many available expression vectors and cell systems using reagents that are well known in the art at the time of the filing of this application. Moreover, synthetic chemistry may be used to introduce mutations into a sequence encoding DevG20, DevG4, or DevG22 or any portion thereof.

Also encompassed by the invention are polynucleotide sequences that are capable of hybridising to the claimed nucleotide sequences, and in particular, those shown in SEQ ID NO:1, SEQ ID NO:3, and in SEQ ID NO:5, under various conditions of stringency. Hybridisation conditions are based on the melting temperature (Tm) of the nucleic acid binding complex or probe, as taught in Wahl, G. M. and S. L. Berger (1987: Methods Enzymol. 152:399-407) and Kimmel, A. R. (1987; Methods Enzymol. 152:507-511), and may be used at a defined stringency. Preferably, hybridization under stringent conditions means that after washing for 1 h with 1×SSC and 0.1% SDS at 50° C., preferably at 55° C., more preferably at 62° C. and most preferably at 68° C., particularly for 1 h in 0.2×SSC and 0.1% SDS at 50° C., preferably at 55° C., more preferably at 62° C. and most preferably at 68° C., a positive hybridization signal is observed. Altered nucleic acid sequences encoding DevG20, DevG4, or DevG22 which are encompassed by the invention include deletions, insertions, or substitutions of different nucleotides resulting in a polynucleotide that encodes the same or a functionally equivalent protein.

The encoded proteins may also contain deletions, insertions, or substitutions of amino acid residues, which produce a silent change and result in a functionally equivalent protein. Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as long as the biological activity of DevG20, DevG4, or DevG22 is retained. For example, negatively charged amino acids may include aspartic acid and glutamic acid; positively charged amino acids may include lysine and arginine; and amino acids with uncharged polar head groups having similar hydrophilicity values may include leucine, isoleucine, and valine; glycine and alanine; asparagine and glutamine; serine and threonine; phenylalanine and tyrosine.

Also included within the scope of the present invention are alleles of the genes encoding DevG20, DevG4, or DevG22. As used herein, an “allele” or “allelic sequence” is an alternative form of the gene, which may result from at least one mutation in the nucleic acid sequence. Alleles may result in altered mRNAs or polypeptides whose structures or function may or may not be altered. Any given gene may have none, one, or many allelic forms. Common mutational changes, which give rise to alleles, are generally ascribed to natural deletions, additions, or substitutions of nucleotides. Each of these types of changes may occur alone, or in combination with the others, one or more times in a given sequence. Methods for DNA sequencing which are well known and generally available in the art may be used to practice any embodiments of the invention. The methods may employ such enzymes as the Klenow fragment of DNA polymerase I, SEQUENASE DNA Polymerase (US Biochemical Corp, Cleveland Ohio), Taq polymerase (Perkin Elmer), thermostable T7 polymerase (Amersham, Chicago, Ill.), or combinations of recombinant polymerases and proof-reading exonucleases such as the ELONGASE Amplification System (GIBCO/BRL, Gaithersburg, Md.). Preferably, the process is automated with machines such as the Hamilton MICROLAB 2200 (Hamilton, Reno Nev.), Peltier thermal cycler (PTC200; MJ Research, Watertown, Mass.) and the ABI 377 DNA sequencers (Perkin Elmer). The nucleic acid sequences encoding DevG20, DevG4, or DevG22 may be extended utilising a partial nucleotide sequence and employing various methods known in the art to detect upstream sequences such as promoters and regulatory elements. For example, one method which may be employed, “restriction-site” PCR, uses universal primers to retrieve unknown sequence adjacent to a known locus (Sarkar, G. (1993) PCR Methods Applic. 2:318-322). In particular, genomic DNA is first amplified in the presence of primer to linker sequence and a primer specific to the known region. The amplified sequences are then subjected to a second round of PCR with the same linker primer and another specific primer internal to the first one. Products of each round of PCR are transcribed with an appropriate RNA polymerase and sequenced using reverse transcriptase. Inverse PCR may also be used to amplify or extend sequences using divergent primers based on a known region (Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186). The primers may be designed using OLIGO 4.06 primer analysis software (National Biosciences Inc., Plymouth, Minn.), or another appropriate program, to 22-30 nucleotides in length, to have a GC content of 50% or more, and to anneal to the target sequence temperatures about 68-72° C. The method uses several restriction enzymes to generate suitable fragments. The fragment is then circularised by intramolecular ligation and used as a PCR template.

Another method which may be used is capture PCR which involves PCR amplification of DNA fragments adjacent to a known sequence in human and yeast artificial chromosome DNA (Lagerstrom, M. et al. (PCR Methods Applic. 1:111-119). In this method, multiple restriction enzyme digestions and ligations also are used to place an engineered double-stranded sequence into an unknown portion of the DNA molecule before performing PCR.

Another method which may be used to retrieve unknown sequences is that of Parker, J. D. et al. (1991; Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries to walk in genomic DNA (Clontech, Palo Alto, Calif.). This process avoids the need to screen libraries and is useful in finding intron/exon junctions.

When screening for full-length cDNAs, it is preferable to use libraries that have been size-selected to include larger cDNAs. Also, random-primed libraries are preferable, in that they will contain more sequences, which contain the 5′ regions of genes. Use of a randomly primed library may be especially preferable for situations in which an oligo d(T) library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into the 5′ and 3′ non-transcribed regulatory regions. Capillary electrophoresis systems, which are commercially available, may be used to analyse the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary sequencing may employ flowable polymers for electrophoretic separation, four different fluorescent dyes (one for each nucleotide) which are laser activated, and detection of the emitted wavelengths by a charge coupled devise camera. Output/light intensity may be converted to electrical signal using appropriate software (e.g. GENOTYPER and SEQUENCE NAVIGATOR, Perkin Elmer) and the entire process from loading of samples to computer analysis and electronic data display may be computer controlled. Capillary electrophoresis is especially preferable for the sequencing of small pieces of DNA, which might be present in limited amounts in a particular sample.

In another embodiment of the invention, polynucleotide sequences which encode DevG20, DevG4, or DevG22 or fragments thereof, or fusion proteins or functional equivalents thereof, may be used in recombinant DNA molecules to direct expression of DevG20, DevG4, or DevG22 in appropriate host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences, which encode substantially the same, or a functionally equivalent amino acid sequence may be produced and these sequences may be used to clone and express DevG20, DevG4, or DevG22. As will be understood by those of skill in the art, it may be advantageous to produce DevG20-encoding, DevG4-encoding, and DevG22-encoding nucleotide sequences possessing non-naturally occurring codons. For example, codons preferred by a particular prokaryotic or eukaryotic host can be selected to increase the rate of protein expression or to produce a recombinant RNA transcript having desirable properties, such as a half-life which is longer than that of a transcript generated from the naturally occurring sequence. The nucleotide sequences of the present invention can be engineered using methods generally known in the art in order to alter DevG20, DevG4, or DevG22 encoding sequences for a variety of reasons, including but not limited to, alterations which modify the cloning, processing, and/or expression of the gene product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, site-directed mutagenesis may be used to insert new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, or introduce mutations, and so forth.

In another embodiment of the invention, natural, modified, or recombinant nucleic acid sequences encoding DevG20, DevG4, or DevG22 may be ligated to a heterologous sequence to encode a fusion protein. For example, to screen peptide libraries for inhibitors of DevG20, DevG4, or DevG22 activities, it may be useful to produce chimeric proteins that can be recognised by a commercially available antibodies. A fusion protein may also be engineered to contain a cleavage site located between the DevG20, DevG4, or DevG22 encoding sequence and the heterologous protein sequences, so that DevG20, DevG4, or DevG22 may be cleaved and purified away from the heterologous moiety. In another embodiment, sequences encoding DevG20, DevG4, or DevG22 may be synthesised, in whole or in part, using chemical methods well known in the art (see Caruthers, M. H. et al. (1980) Nucl. Acids Res. Symp. Ser. 7:215-223, Horn, T. et al. (1980) Nucl. Acids Res. Symp. Ser. 7:225-232). Alternatively, the proteins themselves may be produced using chemical methods by synthesising the amino acid sequence, or a portion thereof. For example, peptide synthesis can be performed using various solid-phase techniques (Roberge, J. Y. et al. (1995) Science 269:202-204) and automated synthesis may be achieved, for example, using the ABI 431A peptide synthesiser (Perkin Elmer). The newly synthesised peptide may be substantially purified by preparative high performance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures and Molecular Principles, WH Freeman and Co., New York, N.Y.). The composition of the synthetic peptides may be confirmed by amino acid analysis or sequencing (e.g., the Edman degradation procedure; Creighton, supra). Additionally, the amino acid sequences, or any part thereof, may be altered during direct synthesis and/or combined using chemical methods with sequences from other proteins, or any part thereof, to produce a variant polypeptide.

In order to express a biologically active DevG20, DevG4, or DevG22, the nucleotide sequences coding therefor or functional equivalents, may be inserted into appropriate expression vectors, i.e., a vector, which contains the necessary elements for the transcription and translation of the inserted coding sequence. Methods, which are well known to those skilled in the art, may be used to construct expression vectors and appropriate transcriptional and translational control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. Such techniques are described in Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York, N.Y.

A variety of expression vector/host systems may be utilised to contain and express sequences encoding DevG20, DevG4, or DevG22. These include, but are not limited to, micro-organisms such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; insect cell systems infected with virus expression vectors (e.g., baculovirus); plant cell systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or PBR322 plasmids); or animal cell systems. The “control elements” or “regulatory sequences” are those non-translated regions of the vector-enhancers, promoters, 5′ and 3′ untranslated regions which interact with host cellular proteins to carry out transcription and translation. Such elements may vary in their strength and specificity. Depending on the vector system and host utilised, any number of suitable transcription and translation elements, including constitutive and inducible promoters, may be used. For example, when cloning in bacterial systems, inducible promoters such as the hybrid lacZ promoter of the BLUESCRIPT phagemid (Stratagene, LaJolla, Calif.) or PSPORT1 plasmid (Gibco BRL) and the like may be used. The baculovirus polyhedrin promoter may be used in insect cells. Promoters and enhancers derived from the genomes of plant cells (e.g., heat shock, RUBISCO; and storage protein genes) or from plant viruses (e.g., viral promoters and leader sequences) may be cloned into the vector. In mammalian cell systems, promoters from mammalian genes or from mammalian viruses are preferable. If it is necessary to generate a cell line that contains multiple copies of the sequences, vectors based on SV40 or EBV may be used with an appropriate selectable marker.

In bacterial systems, a number of expression vectors may be selected depending upon the use intended for the protein. For example, when large quantities of protein are needed for the induction of antibodies, vectors, which direct high level expression of fusion proteins that are readily purified, may be used. Such vectors include, but are not limited to, the multifunctional E. coli cloning and expression vectors such as the BLUESCRIPT phagemid (Stratagene), in which the sequence encoding DevG20, DevG4, or DevG22 may be ligated into the vector in frame with sequences for the amino-terminal Met and the subsequent 7 residues of β-galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. and S. M. Schuster (1989) J. Biol. Chem. 264:5503-5509); and the like. PGEX vectors (Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion proteins with Glutathione S-Transferase (GST). In general, such fusion proteins are soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose beads followed by elution in the presence of free glutathione. Proteins made in such systems may be designed to include heparin, thrombin, or factor XA protease cleavage sites so that the cloned polypeptide of interest can be released from the GST moiety at will. In the yeast, Saccharomyces cerevisiae, a number of vectors containing constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may be used. For reviews, see Ausubel et al., (supra) and Grantet al. (1987) Methods Enzymol. 153:516-544.

In cases where plant expression vectors are used, the expression of sequences encoding DevG20, DevG4, or DevG22 may be driven by any of a number of promoters. For example, viral promoters such as the 35S and 19S promoters of CaMV may be used alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-311). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105). These constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. Such techniques are described in a number of generally available reviews (see, for example, Hobbs, S. or Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York, N.Y.; pp, 191-196).

An insect system may also be used to express DevG20, DevG4, or DevG22. For example, in one such system, Autographa californica nuclear polyhedrosis virus (AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia larvae. The sequences may be cloned into a non-essential region of the virus, such as the polyhedrin gene, and place under control of the polyhedrin promoter. Successful insertions will render the polyhedrin gene inactive and produce recombinant virus lacking coat protein. The recombinant viruses may then be used to infect, for example, S. frugiperda cells of Trichoplusia larvae in which DevG20, DevG4, or DevG22 may be expressed (Engelhard, E. K. et al. (1994) Proc. Nat. Acad. Sci. 91:3224-3227).

In mammalian host cells, a number of viral-based expression systems may be utilised. In cases where an adenovirus is used as an expression vector, sequences encoding DevG20, DevG4, or DevG22 may be ligated into an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader sequence. Insertion in a non-essential E1 or E3 region of the viral genome may be used to obtain viable viruses which are capable of expression in infected host cells (Logan, J. and Shenk, T. (1984) Proc. Natl. Acad. Sci. 81:3655-3659). In addition, transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells.

Specific initiation signals may also be used to achieve more efficient translation. Such signals include the ATG initiation codon and adjacent sequences. In cases where sequences encoding DevG20, DevG4, or DevG22, initiation codons, and upstream sequences are inserted into the appropriate expression vector, no additional transcriptional or translational control signals may be needed. However, in cases where only coding sequence, or a portion thereof is inserted, exogenous translational control signals including the ATG initiation codon should be provided. Furthermore, the initiation codon should be in the correct reading frame to ensure translation of the entire insert. Exogenous translational elements and initiation codons may be of various origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers which are appropriate for the particular cell system which is used, such as those described in the literature (Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162).

In addition, a host cell strain may be chosen for its ability to modulate the expression of the inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a “prepro” form of the protein may also be used to facilitate correct insertion, folding and/or function. Different host cells such as CHO, HeLa, MDCK, HEK293, and WI38, which have specific cellular machinery and characteristic mechanisms for such post-translational activities, may be chosen to ensure the correct modification and processing of the foreign protein.

For long-term, high-yield production of recombinant proteins, stable expression is preferred. For example, cell lines which stably express DevG20, DevG4, or DevG22 may be transformed using expression vectors which may contain viral origins of replication and/or endogenous expression elements and a selectable marker gene on the same or on a separate vector. Following the introduction of the vector, cells may be allowed to grow for 1-2 days in an enriched media before they are switched to selective media. The purpose of the selectable marker is to confer resistance to selection, and its presence allows growth and recovery of cells, which successfully express the introduced sequences. Resistant clones of stably transformed cells may be proliferated using tissue culture techniques appropriate to the cell type. Any number of selection systems may be used to recover transformed cell lines. These include, but are not limited to, the herpes simplex virus thymidine kinase (Wigler, M. et al. (1977) Cell 11:223-32) and adenine phosphoribosyltransferase (Lowy, I. et al. (1980) Cell 22:817-23) genes, which can be employed in tk- or aprt-, cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can be used as the basis for selection; for example, dhfr which confers resistance to methotrexate (Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-70); npt, which confers resistance to the aminoglycosides neomycin and G-418 (Colbere-Garapin, F. et al (1981) J. Mol. Biol. 150:1-14) and als or pat, which confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). Additional selectable genes have been described, for example, trpB, which allows cells to utilise indole in place of tryptophan, or hisD, which allows cells to utilise histinol in place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl. Acad. Sci. 85:8047-51). Recently, the use of visible markers has gained popularity with such markers as anthocyanins, β-glucuronidase and its substrate GUS, and luciferase and its substrate luciferin, being widely used not only to identify transformants, but also to quantify the amount of transient or stable protein expression attributable to a specific vector system (Rhodes, C. A. et al. (1995) Methods Mol. Biol. 55:121-131).

Although the presence/absence of marker gene expression suggests that the gene of interest is also present, its presence and expression may need to be confirmed. For example, if the sequences encoding DevG20, DevG4, or DevG22 are inserted within a marker gene sequence, recombinant cells containing sequences encoding DevG20, DevG4, or DevG22 can be identified by the absence of marker gene function. Alternatively, a marker gene can be placed in tandem with sequences encoding DevG20, DevG4, or DevG22 under the control of a single promoter. Expression of the marker gene in response to induction or selection usually indicates expression of the tandem gene as well. Alternatively, host cells, which contain and express the nucleic acid sequences encoding DevG20, DevG4, or DevG22, may be identified by a variety of procedures known to those of skill in the art. These procedures include, but are not limited to, DNA-DNA, or DNA-RNA hybridisation and protein bioassay or immunoassay techniques which include membrane, solution, or chip based technologies for the detection and/or quantification of nucleic acid or protein.

The presence of polynucleotide sequences encoding DevG20, DevG4, or DevG22 can be detected by DNA-DNA or DNA-RNA hybridisation or amplification using probes or portions or fragments of polynucleotides specific for DevG20, DevG4, or DevG22. Nucleic acid amplification based assays involve the use of oligonucleotides or oligomers based on the sequences specific for DevG20, DevG4, or DevG22 to detect transformants containing DNA or RNA encoding DevG20, DevG4, or DevG22. As used herein “oligonucleotides” or “oligomers” refer to a nucleic acid sequence of at least about 10 nucleotides and as many as about 60 nucleotides, preferably about 15 to 30 nucleotides, and more preferably about 20-25 nucleotides, which can be used as a probe or amplimer.

A variety of protocols for detecting and measuring the expression of DevG20, DevG4, or DevG22, using either polyclonal or monoclonal antibodies specific for the protein are known in the art. Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilising monoclonal antibodies reactive to two non-interfering epitopes on the protein is preferred, but a competitive binding assay may be employed. These and other assays are described, among other places, in Hampton, R. et al. (1990; Serological Methods, a Laboratory Manual, APS Press, St Paul, Minn.) and Maddox, D. E. et al. (1983; J. Exp. Med. 158:1211-1216).

A wide variety of labels and conjugation techniques are known by those skilled in the art and may be used in various nucleic acid and amino acid assays. Means for producing labelled hybridisation or PCR probes for detecting sequences related to polynucleotides encoding DevG20, DevG4, or DevG22 include oligo-labelling, nick translation, end-labelling or PCR amplification using a labelled nucleotide.

Alternatively, the sequences encoding DevG20, DevG4, or DevG22, or any portions thereof may be cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to synthesise RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 and labelled nucleotides. These procedures may be conducted using a variety of commercially available kits (Pharmacia & Upjohn, (Kalamazoo, Mich.); Promega (Madison Wis.); and U.S. Biochemical Corp., (Cleveland, Ohio).

Suitable reporter molecules or labels, which may be used, include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents as well as substrates, co-factors, inhibitors, magnetic particles, and the like.

Host cells transformed with nucleotide sequences encoding DevG20, DevG4, or DevG22 may be cultured under conditions suitable for the expression and recovery of the protein from cell culture. The protein produced by a recombinant cell may be secreted or contained intracellularly depending on the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors may be designed to contain signal sequences, which direct secretion of proteins through a prokaryotic or eukaryotic cell membrane. Other recombinant constructions may be used to join sequences encoding DevG20, DevG4, or DevG22 to nucleotide sequences encoding a polypeptide domain, which will facilitate purification of soluble proteins. Such purification facilitating domains include, but are not limited to, metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilised metals, protein A domains that allow purification on immobilised immunoglobulin, and the domain utilised in the FLAG extension/affinity purification system (Immunex Corp., Seattle, Wash.) The inclusion of cleavable linker sequences such as those specific for Factor XA or Enterokinase (Invitrogen, San Diego, Calif.) between the purification domain and the desired protein may be used to facilitate purification. One such expression vector provides for expression of a fusion protein containing DevG20, DevG4, or DevG22 and a nucleic acid encoding 6 histidine residues preceding a Thioredoxine or an Enterokinase cleavage site. The histidine residues facilitate purification on IMIAC (immobilised metal ion affinity chromatography as described in Porath, J. et al. (1992, Prot. Exp. Purif. 3: 263-281)) while the Enterokinase cleavage site provides a means for purifying DevG20, DevG4, or DevG22 from the fusion protein, A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. et al. (1993; DNA Cell Biol. 12:441-453). In addition to recombinant production, fragments of DevG20, DevG4, or DevG22 may be produced by direct peptide synthesis using solid-phase techniques (Merrifield J. (1963) J. Am. Chem. Soc. 85:2149-2154). Protein synthesis may be performed using manual techniques or by automation. Automated synthesis may be achieved, for example, using Applied Biosystems 431A peptide synthesiser (Perkin Elmer). Various fragments of DevG20, DevG4, or DevG22 may be chemically synthesised separately and combined using chemical methods to produce the full length molecule.

DIAGNOSTIC AND THERAPEUTICS

The data disclosed in this invention show that the nucleic acids and proteins of the invention are useful in diagnostic and therapeutic applications implicated, for example but not limited to, in metabolic disorders like obesity, as well as related disorders such as adipositas, eating disorders, wasting syndromes (cachexia), pancreatic dysfunctions (such as diabetes mellitus), hypertension, pancreatic dysfunctions, arteriosclerosis, coronary artery disease (CAD), coronary heart disease, hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the reproductive organs, sleep apnea, and other diseases and disorders. Hence, diagnostic and therapeutic uses for the DevG20, DevG4, or DevG22 proteins of the invention are, for example but not limited to, the following: (i) protein therapeutic, (ii) small molecule drug target, (iii) antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these tissues and cell types derived from these tissues).

The nucleic acids and proteins of the invention are useful in diagnositic and therapeutic applications implicated in various diseases and disorders described below and/or other pathologies and disorders. For example, but not limited to, cDNAs encoding the proteins of the invention and particularly their human homologues may be useful in gene therapy, and the DevG20, DevG4, or DevG22 proteins of the invention and particularly their human homologues may be useful when administered to a subject in need thereof. By way of non-limiting example, the compositions of the present invention will have efficacy for treatment of patients suffering from, for example, but not limited to, in metabolic disorders like obesity, as well as related disorders such as adipositas, eating disorders, wasting syndromes (cachexia), pancreatic dysfunctions (such as diabetes mellitus), hypertension, arteriosclerosis, coronary heart disease, hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the reproductive organs, sleep apnea, and other diseases and disorders.

The nucleic acid encoding the DevG20, DevG4, or DevG22 protein of the invention, or fragments thereof, may further be useful in diagnostic applications, wherein the presence or amount of the nucleic acids or the proteins are to be assessed. These materials are further useful in the generation of antibodies that bind immunospecifically to the novel substances of the invention for use in therapeutic or diagnostic methods.

For example, in one aspect, antibodies which are specific for DevG20, DevG4, or DevG22 may be used directly as an antagonist, or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express DevG20, DevG4, or DevG22. The antibodies may be generated using methods that are well known in the art. Such antibodies may include, but are not limited to, polyclonal, monoclonal, chimerical, single chain, Fab fragments, and fragments produced by a Fab expression library. Neutralising antibodies, (i.e., those which inhibit dimer formation) are especially preferred for therapeutic use.

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, and others, may be immunised by injection with DevG20, DevG4, or DevG22 or any fragment or oligopeptide thereof which has immunogenic properties. Depending on the host species, various adjuvants may be used to increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels such as aluminium hydroxide, and surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, and dinitrophenol. Among adjuvants used in human, BCG (Bacille Calmette-Guerin) and Corynebacterium parvum are especially preferable. It is preferred that the peptides, fragments, or oligopeptides used to induce antibodies have an amino acid sequence consisting of at least five amino acids, and more preferably at least 10 amino acids. It is preferable that they are identical to a portion of the amino acid sequence of the natural protein, and they may contain the entire amino acid sequence of a small, naturally occurring molecule. Short stretches of amino acids may be fused with those of another protein such as keyhole limpet hemocyanin in order to enhance the immunogenicity.

Monoclonal antibodies to DevG20, DevG4, or DevG22 may be prepared using any technique which provides for the production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma technique (Köhler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods 81:31-42; Cote, R. J. et al. Proc. Natl. Acad. Sci. 80:2026-2030; Cole, S. P. et al. (1984) Mol. Cell Biol. 62:109-120).

In addition, techniques developed for the production of “chimeric antibodies”, the splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate antigen specificity and biological activity can be used (Morrison, S. L. et al. (1984) Proc. Natl. Acad. Sci. 81:6851-6855; Neuberger, M. S. et al (1984) Nature 312:604-608; Takeda, S. et al. (1985) Nature 314:452-454). Alternatively, techniques described for the production of single chain antibodies may be adapted, using methods known in the art, to produce single chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be generated by chain shuffling from random combinatorial immunoglobulin libraries (Burton, D. R. (1991) Proc. Natl. Acad. Sci. 88:11120-3). Antibodies may also be produced by inducing in vivo production in the lymphocyte population or by screening recombinant immunoglobulin libraries or panels of highly specific binding reagents as disclosed in the literature (Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. 86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299).

Antibody fragments, which contain specific binding sites for DevG20, DevG4, or DevG22, may also be generated. For example, such fragments include, but are not limited to, the F(ab′)₂ fragments which can be produced by Pepsin digestion of the antibody molecule and the Fab fragments which can be generated by reducing the disulfide bridges of F(ab′)₂ fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy identification of monoclonal Fab fragments with the desired specificity (Huse, W. D. et al. (1989) Science 254:1275-1281).

Various immunoassays may be used for screening to identify antibodies having the desired specificity. Numerous protocols for competitive binding and immunoradiometric assays using either polyclonal or monoclonal antibodies with established specificities are well known in the art. Such immunoassays typically involve the measurement of complex formation between the protein and its specific antibody. A two-site, monoclonal-based immunoassay utilising monoclonal antibodies reactive to two non-interfering epitopes is preferred, but a competitive binding assay may also be employed (Maddox, supra).

In another embodiment of the invention, the polynucleotides encoding DevG20, DevG4, or DevG22, or any fragment thereof, or antisense molecules, may be used for therapeutic purposes. In one aspect, antisense molecules may be used in situations in which it would be desirable to block the transcription of the mRNA. In particular, cells may be transformed with sequences complementary to polynucleotides encoding DevG20 DevG4, or DevG22. Thus, antisense molecules may be used to modulate protein activity, or to achieve regulation of gene function. Such technology is now well know in the art, and sense or antisense oligomers or larger fragments, can be designed from various locations along the coding or control regions of sequences encoding DevG20, DevG4, or DevG22. Expression vectors derived from retroviruses, adenovirus, herpes or vaccinia viruses, or from various bacterial plasmids may be used for delivery of nucleotide sequences to the targeted organ, tissue or cell population. Methods, which are well known to those skilled in the art, can be used to construct recombinant vectors, which will express antisense molecules complementary to the polynucleotides of the gene encoding DevG20, DevG4, or DevG22. These techniques are described both in Sambrook et al. (supra) and in Ausubel et al. (supra). Genes encoding DevG20, DevG4, or DevG22 can be turned off by transforming a cell or tissue with expression vectors which express high levels of polynucleotide or fragment thereof which encodes DevG20, DevG4, or DevG22. Such constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA molecules until they are disabled by endogenous nucleases. Transient expression may last for a month or more with a non-replicating vector and even longer if appropriate replication elements are part of the vector system.

As mentioned above, modifications of gene expression can be obtained by designing antisense molecules, e.g. DNA, RNA, or PNA molecules, to the control regions of the genes encoding DevG20, DevG4, or DevG22, i.e., the promoters, enhancers, and introns. Oligonucleotides derived from the transcription initiation site, e.g., between positions −10 and +10 from the start site, are preferred. Similarly, inhibition can be achieved using “triple helix” base-pairing methodology. Triple helix pairing is useful because it cause inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have been described in the literature (Gee, J. E. et al. (1994) In; Huber, B. E. and B. I. Carr, Molecular and Immunologic Approaches, Futura Publishing Co., Mt. Kisco, N.Y.). The antisense molecules may also be designed to block translation of mRNA by preventing the transcript from binding to ribosomes.

Ribozymes, enzymatic RNA molecules, may also be used to catalyse the specific cleavage of RNA. The mechanism of ribozyme action involves sequence-specific hybridisation of the ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. Examples, which may be used, include engineered hammerhead motif ribozyme molecules that can be specifically and efficiently catalyse endonucleolytic cleavage of sequences encoding DevG20, DevG4, or DevG22. Specific ribozyme cleavage sites within any potential RNA target are initially identified by scanning the target molecule for ribozyme cleavage sites which include the following sequences: GUA, GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides corresponding to the region of the target gene containing the cleavage site may be evaluated for secondary structural features which may render the oligonucleotide inoperable. The suitability of candidate targets may also be evaluated by testing accessibility to hybridisation with complementary oligonucleotides using ribonuclease protection assays.

Antisense molecules and ribozymes of the invention may be prepared by any method known in the art for the synthesis of nucleic acid molecules. These include techniques for chemically synthesising oligonucleotides such as solid phase phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences encoding DevG20, DevG4, or DevG22. Such DNA sequences may be incorporated into a variety of vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesise antisense RNA constitutively or inducibly can be introduced into cell lines, cells, or tissues. RNA molecules may be modified to increase intracellular stability and half-life. Possible modifications include, but are not limited to, the addition of flanking sequences at the 5′ and/or 3′ ends of the molecule or the use of phosphorothioate or 2′ O-methyl rather than phosphodiesterase linkages within the backbone of the molecule. This concept is inherent in the production of PNAs and can be extended in all of these molecules by the inclusion of non-traditional bases such as inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognised by endogenous endonucleases.

Many methods for introducing vectors into cells or tissues are available and equally suitable for use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells taken from the patient and clonally propagated for autologous transplant back into that same patient. Delivery by transfection and by liposome injections may be achieved using methods, which are well known in the art. Any of the therapeutic methods described above may be applied to any suitable subject including, for example, mammals such as dogs, cats, cows, horses, rabbits, monkeys, and most preferably, humans.

An additional embodiment of the invention relates to the administration of a pharmaceutical composition, in conjunction with a pharmaceutically acceptable carrier, for any of the therapeutic effects discussed above. Such pharmaceutical compositions may consist of DevG20, DevG4, or DevG22, antibodies to DevG20, DevG4, or DevG22, mimetics, agonists, antagonists, or inhibitors of DevG20, DevG4, or DevG22. The compositions may be administered alone or in combination with at least one other agent, such as stabilising compound, which may be administered in any sterile, biocompatible pharmaceutical carrier, including, but not limited to, saline, buffered saline, dextrose, and water. The compositions may be administered to a patient alone, or in combination with other agents, drugs or hormones. The pharmaceutical compositions utilised in this invention may be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means.

In addition to the active ingredients, these pharmaceutical compositions may contain suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries, which facilitate processing of the active compounds into preparations which, can be used pharmaceutically, Further details on techniques for formulation and administration may be found in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing Co., Easton, Pa.). Pharmaceutical compositions for oral administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the patient.

Pharmaceutical preparations for oral use can be obtained through combination of active compounds with solid excipient, optionally grinding a resulting mixture, and processing the mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, or sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums including Arabic and tragacanth; and proteins such as gelatine and collagen. If desired, disintegrating or solubilising agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, alginic acid, or a salt thereof such as sodium alginate. Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar solutions, which may also contain gum Arabic, talc, polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coating for product identification or to characterise the quantity of active compound, i.e., dosage. Pharmaceutical preparations, which can be used orally, include push-fit capsules made of gelatine, as well as soft, sealed capsules made of gelatine and a coating, such as glycerol or sorbitol. Push-fit capsules can contain active ingredients mixed with a filler or binders, such as lactose or starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilisers. In soft capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with or without stabilisers.

Pharmaceutical formulations suitable for parenteral administration may be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain substances, which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the active compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Optionally, the suspension may also contain suitable stabilisers or agents who increase the solubility of the compounds to allow for the preparation of highly concentrated solutions.

For topical or nasal administration, penetrants appropriate to the particular barrier to be permeated are used in the formulation. Such penetrants are generally known in the art.

The pharmaceutical compositions of the present invention may be manufactured in a manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilising processes. The pharmaceutical composition may be provided as a salt and can be formed with many acids, including but not limited to, hydrochloric, sulphuric, acetic, lactic, tartaric, malic, succinic, etc. Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding free base forms. In other cases, the preferred preparation may be a lyophilised powder which may contain any or all of the following: 1-50 mM histidine, 0.1%-2% sucrose, and 2-7% mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to use. After pharmaceutical compositions have been prepared, they can be placed in an appropriate container and labelled for treatment of an indicated condition. For administration of DevG20, DevG4, or DevG22, such labelling would include amount, frequency, and method of administration.

Pharmaceutical compositions suitable for use in the invention include compositions wherein the active ingredients are contained in an effective amount to achieve the intended purpose. The determination of an effective dose is well within the capability of those skilled in the art. For any compounds, the therapeutically effective does can be estimated initially either in cell culture assays, e.g., of preadipocyte cell lines, or in animal models, usually mice, rabbits, dogs, or pigs. The animal model may also be used to determine the appropriate concentration range and route of administration. Such information can then be used to determine useful doses and routes for administration in humans. A therapeutically effective dose refers to that amount of active ingredient, for example DevG20, DevG4, or DevG22 or fragments thereof, antibodies of DevG20, DevG4, or DevG22, to treat a specific condition. Therapeutic efficacy and toxicity may be determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., ED50 (the dose therapeutically effective in 50% of the population) and LD50 (the dose lethal to 50% of the population). The dose ratio between therapeutic and toxic effects is the therapeutic index, and it can be expressed as the ratio, LD50ED50. Pharmaceutical compositions, which exhibit large therapeutic indices, are preferred. The data obtained from cell culture assays and animal studies is used in formulating a range of dosage for human use. The dosage contained in such compositions is preferably within a range of circulating concentrations that include the ED50 with little or no toxicity. The dosage varies within this range depending upon the dosage from employed, sensitivity of the patient, and the route of administration. The exact dosage will be determined by the practitioner, in light of factors related to the subject that requires treatment. Dosage and administration are adjusted to provide sufficient levels of the active moiety or to maintain the desired effect. Factors, which may be taken into account, include the severity of the disease state, general health of the subject, age, weight, and gender of the subject, diet, time and frequency of administration, drug combination(s), reaction sensitivities, and tolerance/response to therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or once every two weeks depending on half-life and clearance rate of the particular formulation. Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up to a total dose of about 1 g, depending upon the route of administration. Guidance as to particular dosages and methods of delivery is provided in the literature and generally available to practitioners in the art. Those skilled in the art employ different formulations for nucleotides than for proteins or their inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, conditions, locations, etc.

In another embodiment, antibodies which specifically bind DevG20, DevG4, or DevG22 may be used for the diagnosis of conditions or diseases characterised by or associated with over- or underexpression of DevG20, DevG4, or DevG22, or in assays to monitor patients being treated with DevG20, DevG4, or DevG22, agonists, antagonists or inhibitors. The antibodies useful for diagnostic purposes may be prepared in the same manner as those described above for therapeutics. Diagnostic assays include methods, which utilise the antibody and a label to detect DevG20, DevG4, or DevG22 in human body fluids or extracts of cells or tissues. The antibodies may be used with or without modification, and may be labelled by joining them, either covalently or non-covalently, with a reporter molecule. A wide variety of reporter molecules which are known in the art may be used several of which are described above.

A variety of protocols including ELISA, RIA, and FACS for measuring DevG20, DevG4, or DevG22 are known in the art and provide a basis for diagnosing altered or abnormal levels of gene expression. Normal or standard values for expression are established by combining body fluids or cell extracts taken from normal mammalian subjects, preferably human, with antibodies to DevG20, DevG4, or DevG22 under conditions suitable for complex formation. The amount of standard complex formation may be quantified by various methods, but preferably by photometry, means. Quantities of DevG20, DevG4, or DevG22 expressed in control and disease, samples from biopsied tissues are compared with the standard values. Deviation between standard and subject values establishes the parameters for diagnosing disease.

In another embodiment of the invention, the polynucleotides specific for DevG20, DevG4, or DevG22 may be used for diagnostic purposes. The polynucieotides, which may be used, include oligonucleotide sequences, antisense RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and quantitate gene expression in biopsied tissues in which expression of DevG20, DevG4, or DevG22 may be correlated with disease. The diagnostic assay may be used to distinguish between absence, presence, and excess expression, and to monitor regulation of gene expression levels during therapeutic intervention.

In one aspect, hybridisation with PCR probes which are capable of detecting polynucleotide sequences, including genomic sequences, encoding DevG20, DevG4, or DevG22 or closely related molecules, may be used to identify nucleic acid sequences which encode DevG20, DevG4, or DevG22. The specificity of the probe, whether it is made from a highly specific region, e.g., unique nucleotides in the 5′ regulatory region, or a less specific region, e.g., especially in the 3′ coding region, and the stringency of the hybridisation or amplification (maximal, high, intermediate, or low) will determine whether the probe identifies only naturally occurring sequences encoding DevG20, DevG4, or DevG22, alleles, or related sequences. Probes may also be used for the detection of related sequences, and should preferably contain at least 50% of the nucleotides from any of the DevG20, DevG4, or DevG22 encoding sequences. The hybridisation probes of the subject invention may be DNA or RNA and derived from the nucleotide sequence of SEQ ID NO:1, SEQ ID NO:3 or SEQ ID NO:5 or from a genomic sequence including promoter, enhancer elements, and introns of the naturally occurring DevG20, DevG4, or DevG22 gene. Means for producing specific hybridisation probes for DNAs encoding DevG20, DevG4, or DevG22 include the cloning of nucleic acid sequences encoding DevG20, DevG4, or DevG22 or derivatives thereof into vectors for the production of mRNA probes. Such vectors are known in the art, commercially available, and may be used to synthesise RNA probes in vitro by means of the addition of the appropriate RNA polymerases and the appropriate labelled nucleotides. Hybridisation probes may be labelled by a variety of reporter groups, for example, radionuclides such as ³²P or ³⁵S, or enzymatic labels, such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like.

Polynucleotide sequences specific for DevG20, DevG4, or DevG22 may be used for the diagnosis of conditions or diseases, which are associated with expression of DevG20, DevG4, or DevG22. Examples of such conditions or diseases include, but are not limited to, pancreatic diseases and disorders, including diabetes. Polynucleotide sequences specific for DevG20, DevG4, or DevG22 may also be used to monitor the progress of patients receiving treatment for pancreatic diseases and disorders, including diabetes. The polynucleotide sequences may be used in Southern or Northern analysis, dot blot, or other membrane-based technologies; in PCR technologies; or in dip stick, pin, ELISA or chip assays utilising fluids or tissues from patient biopsies to detect altered gene expression. Such qualitative or quantitative methods are well known in the art.

In a particular aspect, the nucleotide sequences encoding DevG20, DevG4, or DevG22 may be useful in assays that detect activation or induction of various metabolic diseases and disorders, including obesity, as well as related disorders such as described above. The nucleotide sequences may be labelled by standard methods, and added to a fluid or tissue sample from a patient under conditions suitable for the formation of hybridisation complexes. After a suitable incubation period, the sample is washed and the signal is quantitated and compared with a standard value. If the amount of signal in the biopsied or extracted sample is significantly altered from that of a comparable have hybridised with nucleotide sequences in the sample, and the presence of altered levels of nucleotide sequences in the sample indicates the presence of the associated disease. Such assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or in monitoring the treatment of an individual patient.

In order to provide a basis for the diagnosis of disease associated with expression of DevG20, DevG4, or DevG22, a normal or standard profile for expression is established. This may be accomplished by combining body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, which is specific for DevG20, DevG4, or DevG22, under conditions suitable for hybridisation or amplification.

Standard hybridisation may be quantified by comparing the values obtained from normal subjects with those from an experiment where a known amount of a substantially purified polynucleotide is used. Standard values obtained from normal samples may be compared with values obtained from samples from patients who are symptomatic for disease. Deviation between standard and subject values is used to establish the presence of disease. Once disease is established and a treatment protocol is initiated, hybridisation assays may be repeated on a regular basis to evaluate whether the level of expression in the patient begins to approximate that, which is observed in the normal patient. The results obtained from successive assays may be used to show the efficacy of treatment over a period ranging from several days to months.

With respect to metabolic diseases and disorders, including obesity, as well as related disorders such as adipositas, eating disorders, wasting syndromes (cachexia), pancreatic dysfunctions (such as diabetes mellitus), hypertension, arteriosclerosis, coronary heart disease, hypercholesterolemia, dyslipidemia, osteoarthritis, gallstones, cancer, e.g. cancers of the reproductive organs, sleep apnea, the presence of a relatively high amount of transcript in biopsied tissue from an individual may indicate a predisposition for the development of the disease, or may provide a means for detecting the disease prior to the appearance of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ preventative measures or aggressive treatment earlier thereby preventing the development or further progression of the pancreatic diseases and disorders. Additional diagnostic uses for oligonucleotides designed from the sequences encoding DevG20, DevG4, or DevG22 may involve the use of PCR. Such oligomers may be chemically synthesised, generated enzymatically, or produced from a recombinant source. Oligomers will preferably consist of two nucleotide sequences, one with sense orientation (5′.fwdarw.3′) and another with antisense (3′.rarw.5′), employed under optimised conditions for identification of a specific gene or condition. The same two oligomers, nested sets of oligomers, or even a degenerate pool of oligomers may be employed under less stringent conditions for detection and/or quantification of closely related DNA or RNA sequences.

Methods which may also be used to quantitate the expression of DevG20, DevG4, or DevG22 include radiolabelling or biotinylating nucleotides, coamplification of a control nucleic acid, and standard curves onto which the experimental results are interpolated (Melby, P. C. et al. (1993) J. Immunol. Methods, 159:235-244; Duplaa, C. et al. (1993) Anal. Biochem. 212:229-236). The speed of quantification of multiple samples may be accelerated by running the assay in an ELISA format where the oligomer of interest is presented in various dilutions and a spectrophotometric or calorimetric response gives rapid quantification.

In another embodiment of the invention, the nucleic acid sequences, which are specific for DevG20, DevG4, or DevG22, may also be used to generate hybridisation probes, which are useful for mapping the naturally occurring genomic sequence. The sequences may be mapped to a particular chromosome or to a specific region of the chromosome using well known techniques. Such techniques include FISH, FACS, or artificial chromosome constructions, such as yeast artificial chromosomes, bacterial artificial chromosomes, bacterial P1 constructions or single chromosome cDNA libraries as reviewed in Price, C. M. (1993) Blood Rev. 7:127-134, and Trask, B. J. (1991) Trends Genet. 7:149-154. FISH (as described in Verma et al. (1988) Human Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York, N.Y.) may be correlated with other physical chromosome mapping techniques and genetic map data. Examples of genetic map data can be found in the 1994 Genome Issue of Science (265:1981f). Correlation between the location of the gene encoding DevG20, DevG4, or DevG22 on a physical chromosomal map and a specific disease, or predisposition to a specific disease, may help to delimit the region of DNA associated with that genetic disease.

The nucleotide sequences of the subject invention may be used to detect differences in gene sequences between normal, carrier, or affected individuals. In situ hybridisation of chromosomal preparations and physical mapping techniques such as linkage analysis using established chromosomal markers may be used for extending genetic maps. Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may reveal associated markers even if the number or arm of a particular human chromosome is not known. New sequences can be assigned to chromosomal arms, or parts thereof, by physical mapping. This provides valuable information to investigators searching for disease genes using positional cloning or other gene discovery techniques. Once the disease or syndrome has been crudely localised by genetic linkage to a particular genomic region, for example, AT to 11q22-23 (Gatti, R. A. et al. (1988) Nature 336:577-580), any sequences mapping to that area may represent associated or regulatory genes for further investigation. The nucleotide sequences of the subject invention may also be used to detect differences in the chromosomal location due to translocation, inversion, etc. among normal, carrier, or affected individuals.

In another embodiment of the invention, DevG20, DevG4, or DevG22, their catalytic or immunogenic fragments or oligopeptides thereof, can be used for screening libraries of compounds, e.g. peptides or low-molecular weight organic molecules, in any of a variety of drug screening techniques. The fragment employed in such screening may be free in solution, affixed to a solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes, between the protein and the agent tested, may be measured.

Another technique for drug screening, which may be used, provides for high throughput screening of compounds having suitable binding affinity to the protein of interest as described in published PCT application WO84/03564. In this method, as applied to DevG20, DevG4, or DevG22 large numbers of different test compounds, e.g. small molecules, are synthesised on a solid substrate, such as plastic pins or some other surface. The test compounds are reacted with the proteins, or fragments thereof, and washed. Bound proteins are then detected by methods well known in the art. Purified proteins can also be coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, non-neutralising antibodies can be used to capture the peptide and immobilise it on a solid support. In another embodiment, one may use competitive drug screening assays in which neutralising antibodies capable of binding DevG20, DevG4, or DevG22 specifically compete with a test compound for binding DevG20, DevG4, or DevG22. In this manner, the antibodies can be used to detect the presence of any peptide, which shares one or more antigenic determinants with DevG20, DevG4, or DevG22. In additional embodiments, the nucleotide sequences which encode DevG20, DevG4, or DevG22 may be used in any molecular biology techniques that have yet to be developed, provided the new techniques rely on properties of nucleotide that are currently known, including, but not limited to, such properties as the triplet genetic code and specific base pair interactions.

The Figures show:

FIG. 1 shows the average increase of starvation resistance of HD-EP(X)10478 and HD-EP(X)31424 flies (Drosophila melanogaster; in the Figure, referred to as 10478 and 31424) by ecotopic expression using ‘FB- or elav-Gal4 driver’ (in comparison to wildtype flies (Oregon R); FB stands for fat body; elav-Gal4 stands for elevated Gal4). The average values for surviving flies (,average survivors) are given in % per time point (shown on the horizontal line as time of starvation; 8 hours (8 h) to 72 hours (72 h)) are shown. See Examples for a more detailed description.

FIG. 2 shows the increase of triglyceride content of HD-EP(X)10478 and HD-EP(X)31424 flies by ectopic expression using “FB- or elav-Gal4 driver” (in comparison to wildtype flies (Oregon R)). Standard deviation of the measurements is shown as thin bars. Triglyceride content of the fly populations is shown in ug/mg wet weight (wt) of a fly (vertical).

FIG. 3 shows the molecular organisation of the DevG20 locus.

FIG. 4A shows the nucleic acid sequence (SEQ ID NO:7) encoding the Drosophila DevG20 protein.

FIG. 4B shows the protein sequence (SEQ ID NO:8) of the Drosophila DevG20 encoded by the nucleic acid sequence shown in FIG. 4A.

FIG. 4C shows the nucleic acid sequence (SEQ ID NO:1) of the human DevG20 homolog protein encoding the Homo sapiens hypothetical protein with Genbank Accession Number NM_(—)030810.1 (MGC3178).

FIG. 4D shows the human DevG20 protein sequence (SEQ ID NO:2) (GenBank Accession Number NP_(—)110437.1) encoded by the nucleic acid sequence shown in FIG. 4C.

FIG. 5 shows the BLASTP (versus the non-redundant composite database) identity search result for Drosophila DevG20 protein (SEQ ID NO:8) and the human DevG20 protein (SEQ ID NO:2; GenBank Accession Number NP_(—)110437.1), referred to as hG20 in the Figure. The middle sequence of the alignment shows identical amino acids in the one-letter code and conserved as +. Gaps in the alignment are represented as −.

FIG. 6 shows the expression of DevG20 in mammalian tissues.

FIG. 6A shows the real-time PCR analysis of DevG20-like expression in different wildtype mouse tissues. The relative RNA-expression is shown on the left hand side, the tissues tested are given on the horizontal line (for example, pancreas (‘pancre’), white adipose tissue (‘WA’), brown adipose tissue (‘BA’), muscle (‘muscl’), liver (‘liv’), hypothalamus (‘hypothalam’), cerebellum (‘cerebell’), cortex (‘corte’), midbrain (‘midbra’); small intestine (‘s. intestine’), heart (‘hear’), lung (‘lun’), spleen (‘splee’), kidney (‘kidne’), and bone marrow (‘b. marrow’).

FIG. 6B shows the real-time PCR analysis of DevG20-like expression in different mouse models (wildtype mice (‘wt’)—bars with light grey shading; fasted mice—bars with dark grey shading, obese mice (‘ob/ob’), white bar) in different tissues (white adipose tissue (‘WA’), brown adipose tissue (‘BA’), muscle (‘musc’), liver (‘liv’), pancreas (‘pancre’), hypothalamus (‘hypothala’), cerebellum (‘cerebell’), cortex (‘corte’), midbrain (‘midbra’); small intestine (‘s. intestine’), heart (‘hea’), lung (‘lun’), spleen (‘sple’), kidney (‘kidn’), and bone marrow (‘b. marrow’).

FIG. 6C shows the real-time PCR analysis of DevG20-like expression during the differentiation of 3T3-L1 cells from pre-adipocytes to mature adipocytes. The relative RNA-expression is shown on the left hand side, the days of differention are shown on the horizontal line (d0=day 0, start of the experiment, until d10=day 10).

FIG. 7 shows the relative increase of triglyceride content of EP(2)0646, EP(2)2188, and EP(2)2517 flies caused by homozygous viable integration of the P-vector (in comparison to wildtype flies (EP-control)). Standard deviation of the measurements is shown as thin bars. Triglyceride content of the fly populations is shown as ration TG/Protein content.

FIG. 8 shows the molecular organisation of the DevG4 gene locus.

FIG. 9A shows the nucleic acid sequence (SEQ ID NO:9) encoding the Drosophila DevG4 protein.

FIG. 9B shows the Drosophila DevG4 protein sequence (SEQ ID NO:10) encoded by the mRNA shown in FIG. 9A.

FIG. 9C shows the nucleic acid sequence (SEQ ID NO:3) encoding the human DevG4 homolog (Homo sapiens ATP-binding cassette, sub-family C (CFTR/MRP), member 4, also referred to as ABCC4 and MPR4; GenBank Accession Number NM_(—)005845).

FIG. 9D shows the protein sequence (SEQ ID NO:4; GenBank Accession Number NP_(—)005836.1) of the human DevG4 homolog.

FIG. 10 shows protein domains (black boxes) of the human DevG4 protein.

FIG. 11 shows the comparison of DevG4 protein domains of different species (human ,hMRP4′, mouse (only shown in FIG. 11D, mMRP4), and Drosophila (DevG4)). Gaps in the alignment are represented as −. The alignment was produced using the multisequence alignment program of Clustal V software (Higgins, D. G. and Sharp, P. M. (1989). CABIOS, vol. 5, no. 2, 151-153.).

-   (A) Alignment of the ABC-membrane I domains. The identity of amino     acids of Drosophila DevG4 and human DevG4 (hMRP4) is 41% and the     similarity 63%. -   (B) Alignment of the ABC-tran I domains. The identity of amino acids     of Drosophila DevG4 and human DevG4 (hMRP4) is 56% and the     similarity 75%. -   (C) Alignment of the ABC-membrane II domains. The identity of amino     acids of Drosophila DevG4 and human DevG4 (hMRP4) is 42% and the     similarity 60%. -   (D) Alignment of the ABC-tran II domains. The identity of amino     acids of Drosophila DevG4 and human DevG4 (hMRP4) is 69% and the     similarity 86%. Human and mouse ABC-tran II are almost identical.

FIG. 12 shows the expression of DevG4 in mammalian tissues.

FIG. 12A shows the real-time PCR analysis of DevG4 (MRP4) expression in different wildtype mouse tissues (pancreas (‘pancre’), white adipose tissue (‘WA’), brown adipose tissue (‘BA’), muscle (‘muscl’), liver (‘liv’), hypothalamus (‘hypothalam’), cerebellum (‘cerebell’), cortex (‘corte’), midbrain (‘midbra’); small intestine (‘s. intestine’), heart (‘hear’), lung (‘fun’), spleen (‘splee’), kidney (‘kidne’), and bone marrow (‘b. marrow’). The relative RNA-expression is shown on the left hand side, the tissues tested are given on the horizontal line.

FIG. 12B shows the real-time PCR analysis of DevG4 (MRP4) expression in different mouse models (wildtype mice (‘wt’), fasted mice, obese mice (‘ob/ob’)) in different tissues (white adipose tissue (‘WA’), brown adipose tissue (‘BA’), muscle (‘muscl’), liver (‘liv’), pancreas (‘pancre’), hypothalamus (‘hypothalam’), cerebellum (‘cerebell’), cortex (‘corte’), midbrain (‘midbra’); small intestine (‘s. intestine’), heart (‘hear’), lung (‘lun’), spleen (‘splee’), kidney (‘kidne’), and bone marrow (‘b. marrow’).

FIG. 12C shows the real-time PCR analysis of DevG4 (MRP4) expression during the differentiation of 3T3-L1 cells from pre-adipocytes to mature adipocytes. The relative RNA-expression is shown on the left hand side, the days of differention are shown on the horizontal line (d0=day 0, start of the experiment, until d10 =day 10).

FIG. 13 shows the relative increase of triglyceride content of HD-EP(2)20388 and EP(2)2482 flies caused by homozygous viable integration of the P-vector (in comparison to wildtype flies (EP-control)). Standard deviation of the measurements is shown as thin bars. Triglyceride content of the fly populations is shown as ratio TG/Protein content in percent (%).

FIG. 14 shows the molecular organisation of the DevG22 gene locus.

FIG. 15A shows the nucleic acid sequence (SEQ ID NO:11) encoding the Drosophila DevG22 protein.

FIG. 15B shows the protein sequence (SEQ ID NO:12) of the Drosophila DevG22 protein.

FIG. 15C shows the nucleic acid sequence (SEQ ID NO:5) encoding the human DevG22 homolog (Homo sapiens ATP-binding cassette, sub-family G (WHITE), member 1 protein; GenBank Accession Number XM_(—)009777).

FIG. 15D shows the protein sequence (SEQ ID NO:6) of the human DevG22 homolog (Homo sapiens ATP-binding cassette, sub-family G (WHITE), member 1 protein; GenBank Accession Number XP_(—)009777.3).

FIG. 16 shows protein domain (black box) of the DevG22 protein.

FIG. 17 shows the alignment of human, mouse and fly DevG22 proteins (White-like ABC transporters), White-like ABC transporters only have a single ABC-tran protein domain. Drosophila DevG22 is 36% identical and 52% similar to human DevG22 (hWhite; ABC8, ABCG1, GenBank Accession Number XM_(—)009777). Drosophila DevG22 is 36% identical and 51% similar to mouse DevG22 (mWhite; GenBank Accession Number NP_(—)033723). Human and mouse DevG22 proteins show 95% identity and 96% similarity.

FIG. 18 shows the expression of DevG22 in mammalian tissues.

FIG. 18A shows the real-time PCR analysis of DevG22 expression in different wildtype mouse tissues (pancreas (‘pancre’), white adipose tissue (‘WA’), brown adipose tissue (‘BA’), muscle (‘muscl’), liver (‘liv’), hypothalamus (‘hypothalam’), cerebellum (‘cerebell’), cortex (‘corte’), midbrain (‘midbra’); small intestine (‘sm. testine’), heart (‘hear’), lung (‘lun’), spleen (‘splee’), kidney (‘kidne’), and bone marrow (‘b. marrow’).

FIG. 18B shows the real-time PCR analysis of DevG22 expression in different mouse models (wildtype mice (‘wt’), fasted mice, obese mice (‘ob/ob’)) in different tissues (white adipose tissue (‘WA’), brown adipose tissue (‘BA’), muscle (‘muscl’), liver (‘liv’), pancreas (‘pancre’), hypothalamus (‘hypothalam’), cerebellum (‘cerebell’), cortex (‘corte’), midbrain (‘midbra’); small intestine (‘s. intestine’), heart (‘hear’), lung (‘lun’), spleen (‘splee’), kidney (‘kidne’), and bone marrow (‘b. marrow’).

FIG. 18C shows the real-time PCR analysis of DevG22 expression during the differentiation of 3T3-L1 cells from pre-adipocytes to mature adipocytes.

EXAMPLES

A better understanding of the present invention and of its many advantages will be evident from the following examples, only given by way of illustration.

Example 1

Isolation of EP-lines That Have a Novel Function in Energy Homeostasis Using a Functional Genetic Screen

In order to isolate genes with a function in energy homeostasis several thousand EP-lines were crossed against two “Gal4-driver” lines that direct expression of Gal4 in a tissue specific manner. Two different “driver”-lines were used in the screen: (i) expressing Gal4 mainly in the fatbody (FB), (ii) expressing Gal4 in neurons (elav) (FIG. 1). After crossing the “driver”-line to the EP-line, an endogenous gene may be activated in fatbody or neurons respectively. For selection of relevant genes affecting energy homeostasis, the offspring of that cross was exposed to starvation conditions after six days of feeding. Wildtype flies show a constant starvation resistance. EP-lines with significantly changed starvation resistance were selected as positive candidates.

Example 2

HD-EP(X)10478 and HD-EP(X)31424 Flies Show Significant Starvation Resistance When Driven in the Fatbody or Neurons

Ectopic expression of the EP-lines HD-EP(X)10478 and HD-EP(X)31424, both homozygous viable integrations in the chromosomal region 10D4-10D6, under the control of the “FB-driver” and “elav-driver” caused a significant starvation resistance in comparison to wildtype flies (Oregon R, see FIG. 1). Hundred flies offspring of a cross or line were analysed under starvation conditions. Survivors per time point are shown in FIG. 1. After 24 hours of starvation HD-EP(X)10478 and 31424 flies in combination with both “drivers” show 80-100% more survivors than the wildtype Oregon R. After 48 hours of starvation, almost no wildtype flies are still alive. In contrast, after 48 hours of starvation, about 20% of the population of HD-EP(X)10478 and 31424 in combination with both “drivers” are alive which is a significant increase. Few flies of HD-EP(X)10478 and 31424 in combination with both “drivers” still survive after 72 hours of starvation where normally no wildtype flies are alive. Therefore, ectopic expression via HD-EP(X)10478 and HD-EP(X)31424 in the fatbody and neurons of Drosophila melanogaster leads to significant starvation resistance.

Example 3

Triglyceride Content is Increased by Ectopic Expression Via HD-EP(X)10478 and HD-EP(X)31424 in the Fatbody and Weaker in the Neurons

Starvation resistance can have its origin due to changes in energy homeostasis, e.g., reduction of energy consumption and/or increase in storage of substances like triglycerides. Triglycerides are the most efficient storage for energy in cells. Therefore the content of triglycerides of a pool of flies with the same genotype was analysed using an triglyceride assay.

For determination of triglyceride content of flies, several aliquots of each time ten females of HD-EP(X)10478 and HD-EP(X)31424, HD-EP(X)10478/FB-Gal4 and HD-EP(X)31424/FB-Gal4, HD-EP(X)10478/elav-Gal4 and HD-EP(X)31424/elav-Gal4 and Oregon R were analysed. Flies were incubated for 5 min at 90° C. in an aqueous buffer using a waterbath, followed by hot extraction. After another 5 min incubation at 90° C. and mild centrifugation, the triglyceride content of the flies extract was determined using Sigma Triglyceride (INT 336-10 or -20) assay by measuring changes in the optical density according to the manufacturer's protocol. As a reference fly mass was measured on a fine balance before extraction procedure.

The result of the triglyceride contents analysis is shown in FIG. 2. The average increase of triglyceride content of HD-EP(X)10478 and 31424 flies in combination with the “FB- and elav Gal4-driver” lines is shown in comparison to wildtype flies (Oregon R) and the HD-EP(X)10478 and 31424 integrations alone. Standard deviations of the measurments are shown as thin bars. Triglyceride content of the different fly populations is shown in μg/mg wet weight (wt.) of a fly. In each assay ten females of the offspring of a cross or line were analysed in the triglyceride assay after feeding that offspring for six days. The assay was repeated several times. Wildtype flies show a constant triglyceride level of 30 to 45 μg/mg wet weight of a fly. HD-EP(X)10478 and 31424 flies show a similar or slightly lower triglyceride content than wildtype. In contrast, HD-EP(X)10478 and 31424 flies in combination with both “drivers” show an average increase up to 1.8-fold of 55 to 72 μg/mg wet wt. in comparison to wildtype (Oregon R)flies and the HD-EP(X)10478 and 31424 integration alone. Therefore, gain of a gene activity in the locus 10D4-6 is responsible for changes in the metabolism of the energy storage triglycerides.

Ectopic expression of genomic Drosophila sequences using FB-Gal4 caused an average 1.8-fold increase of triglyceride content in comparison to wildtype flies (Oregon R). HD-EP(X)10478 and HD-EP(X)31424 under the control of the “elav-Gal4-driver” caused a weaker increase of triglyceride content. Therefore ectopic expression via HD-EP(X)10478 and HD-EP(X)31424 in the fatbody of Drosophila melanogaster leads to a significant increase of the energy storage triglyceride and therefore represents an obese fly model. The increase of triglyceride content by gain of a gene function suggests a gene activity in energy homeostasis in a dose dependent and tissue specific manner that controls the amount of energy stored as triglycerides.

Example 4

Measurement of Triglyceride Content of Homozygous Flies (EP(2)0646, EP(2)2188, EP(2)2517, HD-EP(2)20388, EP(2)2482)

Triglycerides are the most efficient storage for energy in cells. In order to isolate genes with a function in energy homeostasis, several thousand EP-lines were tested for their triglyceride content after a prolonged feeding period. Lines with significantly changed triglyceride content were selected as positive candidates for further analysis. In this invention, the content of triglycerides of a pool of flies with the same genotype after feeding for six days was analysed using a triglyceride assay. For determination of triglyceride content, several aliquots of each time 10 males of the offspring of a cross or line were analysed after feeding the offspring for six days. Fly mass was measured on a fine balance as a reference. Flies were extracted in methanol/chloroform (1:1) and an aliquot of the extract was evaporated under vacuum. Lipids were emulsified in an aqueous buffer with help of sonification. Triglyceride content was determined using Sigma INT 336-10 or -20 assay by measuring changes in the optical density according to the manufacturer's protocol.

Improving and simplifying the determination of triglyceride content of flies, In each assay ten males of the offspring of a cross or line were analysed in the triglyceride assay after feeding that offspring for six days; the assay was repeated several times. Flies were incubated for 5 min at 90° C. in an aqueous buffer using a waterbath, followed by hot extraction. After another 5 min incubation at 90° C. and mild centrifugation, the triglyceride content of the flies extract was determined using Sigma Triglyceride (INT 336-10 or -20) assay by measuring changes in the optical density according to the manufacturer's protocol. As a reference protein content of the same extract was measured using BIO-RAD DC Protein Assay according to the manufacturer's protocol.

Wildtype flies show constantly a triglyceride level of 11 to 23 μg/mg wet weight of a fly. EP(2)0646, EP(2)2188, EP(2)2517, and HD-EP(2)20388, and EP(2)2482 homozygous flies show constantly a higher triglyceride content than the wildtype (FIGS. 7 and 13). In contrast, EP(2)0646, EP(2)2188, EP(2)2517, HD-EP(2)20388, and EP(2)2482 flies in combination with both “drivers” show sometimes only a slightly increase (2.1- to 2.3-fold of 49 to 53 μg/mg wet wt) in comparison to the wildtype (Oregon R) (not shown). Therefore, the loss of gene activity in the loci, where the P-vector of EP(2)0646, EP(2)2188, EP(2)2517, HD-EP(2)20388, and EP(2)2482 flies is homozygous viably integrated, is responsible for changes in the metabolism of the energy storage triglycerides, therefore representing in both cases an obese fly model.

The increase of triglyceride content due to the loss of a gene function suggests potential gene activities in energy homeostasis in a dose dependent manner that controls the amount of energy stored as triglycerides.

Example 5

Identification of the Genes

DevG20 (PDI)

Nucleic acids encoding the DevG20 protein of the present invention were identified using plasmid-rescue technique. Genomic DNA sequences of about 1 kb were isolated that are localised directly 3′ to HD-EP(X)10478 or HD-EP(X)31424 integrations. Using those isolated genomic sequences public databases like Berkeley Drosophila Genome Project (GadFly) were screened thereby confirming the integration side of HD-EP(X)10478 and HD-EP(X)31424 and nearby localised endogenous genes (FIG. 3). FIG. 3 shows the molecular organisation of the DevG20 locus. Genomic DNA sequence is represented by the assembly AE003487 as a black line that includes the integration sites of EP(X)1503, HD-EP(X)10478 and HD-EP(X)31424. Numbers represent the coordinates of AE003487 genomic DNA, the predicted genes and the EP-vector integration sites. Arrows represent the direction of ectopic expression of endogenous genes controlled by the Gal4 promoters in the EP-vectors. Predicted exons of genes CG2446 and CG1837 are shown as grey bars. Using plasmid rescue method about 1 kb genomic DNA sequences that are directly localised 3′ of the HD-EP(X)10478 and HD-EP(X)31424 integration sites were isolated. Using the 1 kb plasmid rescue DNA public DNA sequence databases were screened thereby identifying the integration sites of HD-EP(X)10478 and HD-EP(X)31424.

HD-EP(X)10478 and HD-EP(X)31424 are integrated in the predicted gene CG2446 that is represented by the EST clots 241_(—)2-4 but their Gal4 promoters direct ectopic expression of endogenous genes in the opposite direction in respect to the direction of CG2446 expressions About 2 kb 3′ of HD-EP(X)10478 and HD-EP(X)31424 integration sites the predicted gene CG1837 is localized that corresponds to est clot 3553_(—)14 and could be expressed ectopically using “FB- and elav-Gal4-drivers”. The ectopic expression of CG1837 in the fatbody or weaker in neurons leads to increase of triglyceride content in flies.

HD-EP(X)10478 is inserted into the first predicted exon of CG2446 that corresponds to the EST clot 241_(—)2-4 in antisense orientation. Gal4 promoter region of HD-EP(X)10478 drives expression in the opposite direction than CG2446 is expressed therefore could drive the ectopic expression of another endogenous gene. HD-EP(X)31424 is inserted in the first predicted exon of CG2446 and its Gal4 promoter drives expression in the opposite direction in comparison to CG2446 expression. A different endogenous gene CG1837 corresponding to EST clot 3553_(—)14 is localized 2180 base pairs 3′ in sense direction of both EP-integrations. CG1837 can be expressed ectopically via HD-EP(X)10478 and HD-EP(X)31424, leading to obesity.

DevG4 (MRP4)

Nucleic acids encoding the DevG4 protein of the present invention were identified using plasmid-rescue technique. Genomic DNA sequences of about 0.8 kb were isolated that are localised directly 3′ to the EP(2)0646, EP(2)2517 and EP(2)2188 integration. Using those isolated genomic sequences public databases like Berkeley Drosophila Genome Project (GadFly) were screened thereby confirming the integration side of 0646, EP(2)2517 and EP(2)2188 and nearby localised endogenous genes (FIG. 8). FIG. 8 shows the molecular organisation of the DevG4 locus. In FIG. 8, genomic DNA sequence is represented by the assembly as a dotted black line (17.5 kb, starting at position 8256000 on chromosome 2L) that includes the integration sites of 0646, EP(2)2517 and EP(2)2188 (arrows). Numbers represent the coordinates of the genomic DNA. Arrows represent the direction of ectopic expression of endogenous genes controlled by the Gal4 promoters in the EP-vectors. Transcribed DNA sequences (ESTs and clots) are shown as bars in the lower two lines. Predicted exons of gene CG7627 (GadFly) are shown as green bars and introns as grey bars.

0646, EP(2)2517 and EP(2)2188 are integrated directly 5′ of the EST Clot 6022_(—)1 in antisense orientation. Clot 6022_(—)1 represents a cDNA clone meaning that is showing that the DNA sequence is expressed in Drosophila. Clot 6022_(—)1 sequence overlaps with the sequence of the predicted gene CG7627 therefore Clot 6022_(—)1 includes the 5′ end of DevG4 gene and EP(2)0646 and EP(2)2517 are homozygous viably integrated in the promoter of DevG4. Using the 0.8 kb plasmid rescue DNA, public DNA sequence databases were screened thereby identifying the integration sites of EP(2)0646 and EP(2)2517. It was found that EP(2)0646 and EP(2)2517 are integrated in the promoter of the gene with GadFly Accession Number CG7627 that is also represented by the EST clot 6022_(—)1. The Gal4 promoters of should direct ectopic expression of endogenous genes in the opposite direction in respect to the direction of CG7627 expression. Therefore, expression of the CG7627 could be effected by homozygous viable integration of EP(2)0646, EP(2)2517 and EP(2)2188 leading to increase of the energy storage triglycerides

DevG22

FIG. 14 shows genomic DNA sequence represented by the assembly as a dotted black line (15 kb, starting at position 171400.5 on chromosome 2L) that includes the integration site of EP(2)20388 (arrow). Numbers represent the coordinates of the genomic DNA. Arrows represent the direction of ectopic expression of endogenous genes controlled by the Gal4 promoters in the EP-vectors. Transcribed DNA sequences (ESTs and clots) are shown as green bars in another line. Predicted exons of gene with GadFly Accession Number CG17646 are shown as green bars and introns as grey bars. It was found that DevG22 encodes for a novel gene that is predicted by GadFly sequence analysis programs as CG17646. Using plasmid rescue method about 0.6 kb genomic DNA sequences that are directly localised 3′ of the EP(2)20388 integration site were isolated. Using the 0.6 kb plasmid rescue DNA, public DNA sequence databases were screened thereby identifying the integration site of EP(2)20388. EP(2)20388 is integrated directly 5′ of the EST SD03967 in sense orientation. SD03967 represents a cDNA clone meaning that its DNA sequence is expressed in Drosophila. SD03967 sequence overlaps with the 5′ sequence of the predicted gene CG17646 therefore SD03967 includes the 5′ and the 3′ end of DevG22 gene. The 3′ end of SD03967 does not overlap with CG17646 sequence therefore the cDNA of DevG22 might be even longer than shown in FIG. 14. EP(2)20388 is integrated in the promoter of the gene CG17646 that is also represented by EST SD03967; its Gal4 promoter should direct ectopic expression of CG17646. Therefore, expression of the CG17646 could be effected by homozygous viable integration of EP(2)20388 leading to increase of the energy storage triglycerides.

Example 6

Analysis of DevG20

DevG20 encodes for a novel gene that is predicted by GadFly sequence analysis programs and isolated EST clones. Neither phenotypic nor functional data are available in the prior art for the novel gene CG1837, referred to as DevG20 in the present invention. The present invention is describing the nucleic acid sequence of DevG20, as shown in FIG. 4A, SEQ ID NO:1.

The present invention is describing a polypeptide comprising the amino acid sequence of SEQ ID NO:2, as presented using the one-letter code in FIG. 4B. DevG20 is 416 amino acids in length. An open reading frame was identified by beginning with an ATP initiation codon at nucleotide 37 and ending with a CAC stop codon at nucleotide 1284 (FIG. 4B).

The predicted amino acid sequence was searched in the publicly available GenBank database. In search of sequence databases, it was found, for example, that DevG20 has 60% homology with human hGRP58 protein, a potential 58 kDa glucose regulated protein of 324 amino acids (GenBank Accession Number NP_(—)110437.1; identical to former Accession Numbers AAH01199 and BC001199) (see FIGS. 4C and 4D; SEQ ID NO:1 and 2). In particular, Drosophila DevG20 and human hGRP58 protein share 60% homology (see FIG. 5), starting between amino acid 84 and 407 of DevG20 (and amino acids 1 to 316 of hGRP58). hGRP58 protein is homologous to a mouse protein encoded by the cDNA clone 601333564F1 NCI_CGAP_Mam6, identified using tblastp sequence comparison of a protein with translated mouse EST clones.

Using InterPro protein analysis tools, it was found, for example, that the DevG20 protein has at least three Thioredoxin protein motifs and an endoplasmic reticulum target sequence. These motifs and targeting sequencing are also found in glucose-regulated proteins and Protein disulfide isomerases. Glucose regulated proteins and Protein disulfide isomerases are chaperones that are involved in many different processes like lipoprotein assembly at the endoplasmic reticulum.

DevG20 encodes for a novel protein that is homologous to the family of protein disulfide isomerases or glucose regulated proteins. Based upon homology, DevG20 protein of the invention and each homologous protein or peptide may share at least some activity.

Example 7

Analysis of DevG4

As described above, DevG4 is encoded by GadFly Accession Number CG7627. The nucleic acid sequence of Drosophila DevG4, as shown in FIG. 9A, SEQ ID NO:9. The present invention is describing a polypeptide comprising the amino acid sequence of SEQ ID NO:10, as presented using the one-letter code in FIG. 9B. Drosophila DevG4 protein is 1355 amino acids in length. An open reading frame was identified beginning with an ATP initiation codon at nucleotide 158 and ending with a stop codon at nucleotide 4225. Drosophila DevG4 has additional 28 amino acids at the N-terminus without changing the frame in comparison to the predicted CG7627 protein after combining Clot 6022_(—)1 and CG7627 cDNA sequences.

The predicted amino acid sequence was searched in the publicly available GenBank (NCBI) database. The search indicated that Drosophila DevG4 has about 40% identity with human MRP4 (MOAT-B) protein, a ATP-binding cassette (ABC) transporter protein of 1325 amino acids (Accession Number: NP_(—)005836; SEQ ID NO:10) (see FIG. 9C). In particular, Drosophila DevG4 and human homolog DevG4 (hMRP4) proteins share about 80% homology (see FIG. 9D), starting between amino acid 8 and 1330 of DevG4 (and amino acids 7 to 1277 of hMRP4).

Since the protein domains found in member of the ABC superfamily are highly conserved, a comparison (Clustal×1.8) between the four protein domains of Drosophila DevG4 with human and mouse homolog proteins was conducted (see FIG. 11). We found that human and mouse (sequence is only partially available) MRP4 as closest homologous proteins to the Drosophila DevG4 protein. Using InterPro protein analysis tools, it was found, that the DevG4 protein has at least 4 four protein motifs domains (FIG. 10). These motifs and targeting sequencing are found throughout the whole ABC transporter superfamily. ABC transporters are membrane spanning proteins that are involved in many different transport processes, FIG. 11 A shows the alignment of the ABC-membrane I domains. The identity of amino acids of Drosophila DevG4 and human hMRP4 is 41% and the similarity of the sequence is 63%. FIG. 11B shows the alignment of the ABC-tran I domains. The identity of amino acids of Drosophila DevG4 and human hMRP4 is 56% and the similarity 75%. No mouse sequence is available. FIG. 11C shows the alignment of the ABC-membrane II domains. The identity of amino acids of Drosophila DevG4 and human hMRP4 is 42% and the similarity 60%. No mouse sequence is available. FIG. 11D shows the alignment of the ABC-tran II domains. No mouse sequence is available. The identity of amino acids of Drosophila DevG4 and human hMRP4 is 69% and the similarity 86%. Human and mouse ABC-tran II domains are almost identical.

Based upon homology, Drosophila DevG4 protein and each homologous protein or peptide may share at least some activity. The DevG4 protein has two characteristic ABC-membrane domains, a six transmembrane helical region (labeled ‘ABC_membrane’ in FIG. 10, ABC transporter transmembrane region) which anchors the protein in cell membranes. In addition, DevG4 has two ABC-transporter domains of several hundred amino acid residues (labeled ‘ABC-tran’ in FIG. 10, ABC transporter), including an ATP-binding site. Proteins of the ABC family are membrane spanning proteins associated with a variety of distinct biological processes in both prokaryotes and eukaryotes, for example in transport processes such as active transport of small hydrophilic molecules across the cytoplasmic membrane. Furthermore, a single MMR-HSR1 domain (GTPase of unknown function, light grey square box in FIG. 4A) was identified in DevG4. FIG. 10 shows the has a single characteristic ABC-transporter domain (‘ABC trans’) of the DevG22 protein.

Example 8

Analyis of DevG22

As discussed above, Drosophila DevG22 protein is encoded GadFly accession number CG17646. The present invention is describing the nucleic acid sequence of DevG22, as shown in FIG. 15A, SEQ ID NO:11. The present invention is describing a polypeptide comprising the amino acid sequence of SEQ ID NO:12, as presented using the one-lefter code in FIG. 15B. Drosophila DevG22 protein is 627 amino acids in length. An open reading frame was identified beginning with an ATP initiation codon at nucleotide 576 and ending with a stop codon at nucleotide 2459.

The predicted amino acid sequence was searched in the publicly available GenBank database. In search of sequence databases, it was found, for example, that DevG22 has almost 40% identity with human White (ABC8, ABCG1) protein, a ATP-binding cassette (ABC) transporter protein of 674 amino acids (GenBank Accession Number XP_(—)009777.3; see FIGS. 15C and 15D; SEQ ID NO:5 and SEQ ID NO:6). In particular, Drosophila DevG22 and the human homolog protein share about 70% homology (see FIG. 17), starting between amino acid 57 and 622 of Drosophila DevG22 (and amino acids 27 to 507 of human DevG22-hWhite).

Using InterPro protein analysis tools, it was found that the DevG22 protein has at least 1 one protein motif (FIG. 16). The White-like subfamily of ABC transporters is characterized by the single ABC-tran domain and the overall amino acid sequence. Therefore, the complete coding sequence and not only the domains are compared, FIG. 17 shows the alignment of human, mouse, and Drosophila DevG22 proteins. Drosophila DevG22 is 36% identical and 52% similar to human DevG22 (hWhite; ABC8, ABCG1, GenBank Accession Number XP_(—)009777). Drosophila DevG22 is 36% identical and 51% similar to mouse DevG22 (mWhite; GenBank Accession Number NP_(—)033723). Therefore, the vertebrate white transporter is the closest homologue to Drosophila DevG22. Human and mouse White proteins show 95% identity and 96% similarity. Based upon homology, DevG22 protein of the invention and each homologous protein or peptide may share at least some activity.

Example 9

Expression of the Polypeptides in Mammalian Tissues

For analyzing the expression of the polypeptides disclosed in this invention in mammalian tissues, several mouse strains (preferrably mice strains C57BI/6J, C57BI/6 ob/ob and C57BI/KS db/db which are standard model systems in obesity and diabetes research) were purchased from Harlan Winkelmann (33178 Borchen, Germany) and maintained under constant temperature (preferrably 22° C.), 40 percent humidity and a light/dark cycle of preferrably 14/10 hours. The mice were fed a standard chow (for example, from ssniff Spezialitäten GmbH, order number ssniff M-Z V1126-000). Animals were sacrificed at an age of 6 to 8 weeks. The animal tissues were isolated according to standard procedures known to those skilled in the art, snap frozen in liquid nitrogen and stored at −80° C. until needed.

For analyzing the role of the proteins disclosed in this invention in the in vitro differentiation of different mammalian cell culture cells for the conversion of pre-adipocytes to adipocytes, mammalian fibroblast (3T3-L1) cells (e.g., Green & Kehinde, Cell 1: 113-116, 1974) were obtained from the American Tissue Culture Collection (ATCC, Hanassas, Va., USA; ATCC-CL 173). 3T3-L1 cells were maintained as fibroblasts and differentiated into adipocytes as described in the prior art (erg., Qiu. et al., J. Biol. Chem. 276:11988-95, 2001; Slieker et al., BBRC 251: 225-9, 1998). At various time points of the differentiation procedure, beginning with day 0 (day of confluence) and day 2 (hormone addition; for example, dexamethason and 3-isobutyl-1-methylxanthin), up to 10 days of differentiation, suitable aliquots of cells were taken every two days. Alternatively, mammalian fibroblast 3T3-F442A cells (e.g., Green & Kehinde, Cell 7: 105-113, 1976) were obtained from the Harvard Medical School, Department of Cell Biology (Boston, Mass., USA). 3T3-F442A cells were maintained as fibroblasts and differentiated into adipocytes as described previously (Djian, P. et al., J. Cell. Physiol., 124:554-556, 1985). At various time points of the differentiation procedure, beginning with day 0 (day of confluence and hormone addition, for example, Insulin), up to 10 days of differentiation, suitable aliquots of cells were taken every two days. 3T3-F442A cells are differentiating in vitro already in the confluent stage after hormone (insulin) addition.

RNA was isolated from mouse tissues or cell culture cells using Trizol Reagent (for example, from Invitrogen, Karlsruhe, Germany) and further purified with the RNeasy Kit (for example, from Qiagen, Germany) in combination with an DNase-treatment according to the instructions of the manufacturers and as known to those skilled in the art. Total RNA was reverse transcribed (preferrably using Superscript II RNaseH⁻ Reverse Transcriptase, from Invitrogen, Karlsruhe, Germany) and subjected to Taqman analysis preferrably using the Taqman 2×PCR Master Mix (from Applied Biosystems, Weiterstadt, Germany; the Mix contains according to the Manufacturer for example AmpliTaq Gold DNA Polymerase, AmpErase UNG, dNTPs with dUTP, passive reference Rox and optimized buffer components) on a GeneAmp 5700 Sequence Detection System (from Applied Biosystems, Weiterstadt, Germany).

For the analysis of the expression of DevG20, taqman analysis was performed using the following primer/probe pair (see FIG. 6): Mouse DevG20 (PDI) forward primer (SEQ ID NO:13): 5′-CAC GGG TGA CAA GGG CA-3′; mouse DevG20 (PDI) reverse primer (SEQ ID NO:14): 5′-CCC CTG TGC AAT AGT GTC CTC-3′; Taqman probe (SEQ ID NO:15): (5/6-FAM) TGC TGG CAC TCA CCG AGA AGA GCT T (5/6-TAMRA).

As shown in FIG. 6A, real time PCR (Taqman) analysis of the expression of DevG20 protein in mammalian (mouse) tissues revealed that DevG20 (PDI) is rather ubiquitously expressed in various mouse tissues, However, a clear expression in WAT and BAT can also be demonstrated. DevG20 (PDI) shows an up-regulation of its expression in BAT, cortex and spleen of genetically obese ob/ob mice (FIG. 6B). In addition, its expression in kidney and bone marrow of fasted mice is also up-regulated. Even though no up-regulation of DevG20 (PDI) expression in WAT of ob/ob mice has been observed, we can clearly demonstrate a two-fold up-regulation of its expression during the in vitro differentiation of 3T3-L1 cells from pre-adipocytes to mature adipocytes (FIG. 6C).

For the analysis of the expression of DevG4, taqman analysis was performed using the following primer/probe pair (see FIG. 12); Mouse DevG4 (mrp4) forward primer (SEQ ID NO:16): 5′-CAA GTA GCG CCC ACC CC-3′; Mouse DevG4 (mrp4) reverse primer (SEQ ID NO:17): 5′-AGT TCA CAT TGT CGA AGA CGA TGA-3′; Taqman probe (SEQ ID NO:18): (5/6-FAM) AGG CTG GCC CCA CGA GGG A (5/6-TAMRA).

Taqman analysis revealed that DevG4 (mrp4) is ubiquitously expressed in various mouse tissues with highest levels of expression found in kidney (FIG. 12A). DevG4 (mrp4) shows a very prominent up-regulation of its expression in liver of genetically obese ob/ob mice (FIG. 12B). In addition, a significant up-regulation in kidney can also be observed under these conditions. Under fasting conditions, DevG4 (mrp4) expression seems to show a global down-regulation of its expression, this is especially prominent in the BAT tissue of fasting mice. DevG4 (mrp4) expression increases approximately 4-fold during the in vitro differentiation of 3T3-L1 cells from pre-adipocytes to mature adipocytes (FIG. 12C).

For the analysis of the expression of DevG22, taqman analysis was performed using the following primer/probe pair: Mouse DevG22 (white) forward primer (SEQ ID NO:18): 5′-TCG TAT ACT GGA TGA CGT CCC A-3′; Mouse DevG22 (white) reverse primer (SEQ ID NO:19): 5′-TGG TAC CCA GAG CAG CGA AC-3′; Taqman probe (SEQ ID NO:20): (5/6-FAM) CCG TCG GAC GCT GTG CGT TTT (5/6-TAMRA).

Taqman analysis revealed that DevG22 (white) is predominantly expressed in neuronal tissues. However, a clear expression in other tissues like WAT or BAT has also been noted (FIG. 18A and 18B). The expression of DevG22 (white) in BAT and WAT is under metabolic control: In fasted mice, expression goes up in BAT. Contrary to this, expression is increased in WAT and muscle in genetically obese ob/ob mice (FIG. 18B). This up-regulation in ob/ob mice correlates with the observed strong up-regulation of DevG22 (white) expression during the in vitro differentiation of 3T3-L1 cells (FIG. 18C).

All publications and patents mentioned in the above specification are herein incorporated by reference.

Various modifications and variations of the described method and system of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific preferred embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention which are obvious to those skilled in molecular biology or related fields are intended to be within the scope of the following claims.

LITERATURE

Klucken J, Büchler C, Orsó E, Kaminski W E, Porsch-Özcürümez M, Liebisch G, Kapinsky M, Diederich W, Drobnik W, Dean M, Allikmets R, Schmitz G.: ABCG1 (ABC8), the human homolog of the Drosophila white gene, is a regulator of macrophage cholesterol and phospholipid transport. Proc Natl Acad Sci USA. 2000 Jan; 97(2):817-22.

Orsó E, Broccardo C, Kaminski W E, Böttcher A, Liebisch G, Drobnik W, Götz A, Chambenoit O, Diederich W, Langmann T, Spruss T, Luciani M-F, Rothe G, Lackner K J, Chimini G, Schmitz G.: Transport of lipids from Golgi to plasma membrane is defective in Tangier disease patients and Abc1-deficient mice. Nat Genet. 2000 Feb; 24:192-6.

Venkateswaran A, Repa J J, Lobaccaro J-MA, Bronson A, Mangelsdorf D J, Edwards P A.: Human White/murine ABC8 mRNA levels are highly induced in lipid-loaded macrophages. J Biol Chem. 2000 May; 275(19):14700-7.

Nakamura M, Ueno S, Sano A, Tanabe H.: Polymorphisms of the human homologue of the Drosophila white gene are associated with mood and panic disorders. Mol Psychiatry. 1999 Mar; 4(2):155-62.

Bodzioch M, Orso E, Klucken J, Langmann T. Bottcher A, Diederich W, Drobnik W, Barlage S, Buchler C, Porsch-Ozcurumez M, Kaminski W E, Hahmann H W, Oette K, Rothe S, Aslanidis C, Lackner K J, Schmitz G.: The gene encoding ATP-binding cassette transporter 1 is mutated in Tangier disease. Nat Genet. 1999 Aug; 22(4):347-51.

Brooks-Wilson A, Marcil M, Clee S M, Zhang L H, Roomp K, van Dam M, Yu L, Brewer C, Collins J A, Molhuizen H O, Loubser O, Ouelette B F, Fichter K, Ashbourne-Excoffon K J, Sensen C W, Scherer S, Mott S, Denis M, Martindale D, Frohlich J, Morgan K, Koop B, Pimstone S, Kastelein J J, Hayden M R, et al.: Mutations in ABC1 in Tangier disease and familial high-density lipoprotein deficiency. Nat Genet. 1999 Aug; 22(4):336-45.

Rust S, Rosier M, Funke H, Real J, Amoura Z, Piette J C, Deleuze J F, Brewer H B, Duverger N, Denefle P, Assmann G.: Tangier disease is caused by mutations in the gene encoding ATP-binding cassette transporter 1. Nat Genet. 1999 Aug; 22(4):352-5.

Schuetz J D, Connelly M C, Sun D, Paibir S D, Flynn P M, Srinivas R V, Kumar A, Fridland A.: MRP4. A previously unidentified factor in resistance to nucleoside-based antiviral drugs. Nat Med. 1999 Sep; 5(9):1048-51.

Wada M, Toh S, Taniguchi K, Nakamura T, Uchiumi T, Kohno K, Yoshida I, Kimura A, Sakisaka S, Adachi Y, Kuwano M.: Mutations in the canilicular organic anion transporter (cMOAT) gene, a novel ABC transporter, in patients with hyperbilirubinemia II/Dubin-Johnson syndrome. Hum Mol Genet. 1998 Feb; 7(2):203-7.

Allikmets R, Singh N, Sun H, Shroyer N F, Hutchinson A, Chidambaram A, Gerrard B, Baird L, Stauffer D, Peiffer A, Rattner A, Smallwood P, Li Y, Anderson K L, Lewis R A, Nathans J, Leppert M, Dean M, Lupski J R.: A photoreceptor cell-specific ATP-binding transporter gene (ABCR) is mutated in recessive Stargardt macular dystrophy. Nat Genet. 1997 Mar; 15(3):236-46. 

1. A composition comprising an isolated ABC transporter DevG20 polypeptide encoded by a polynucleotide sequence as defined in SEQ ID NO:1, and an acceptable carrier, wherein said ABC transporter DevG20 polypeptide regulates energy homeostasis and metabolism of triglycerides.
 2. The composition of claim 1, wherein the ABC transporter DevG20 polypeptide consisting of an amino acid sequence set forth in SEQ ID NO:2.
 3. The composition of claim 1, wherein the ABC transporter DevG20 polypeptide is a recombinant polypeptide.
 4. The composition of claim 1, wherein the ABC transporter DevG20 polypeptide is a recombinant fusion polypeptide.
 5. The composition of claim 1, wherein the composition is a diagnostic composition. 